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INTRACELLULAR SIGNALING MOLECULES 



TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of intracellular signaling 
5 molecules and to the use of these sequences in the diagnosis, treatment, and prevention of cell 
proliferative, autoimmune/inflammatory, neurological, gastrointestinal, reproductive, and 
developmental disorders. 



BACKGROUND OF THE INVENTION 

10 Cell-cell communication is essential for the growth, development, and survival of 

multicellular organisms. Cells communicate by sending and receiving molecular signals. An 
example of a molecular signal is a growth factor, which binds and activates a specific transmembrane 
receptor on the surface of a target cell. The activated receptor transduces the signal intracellularly, 
thus initiating a cascade of biochemical reactions that ultimately affect gene transcription and cell 

15 cycle progression in the target cell. 

Intracellular signaling is the process by which cells respond to extracellular signals 
(hormones, neurotransmitters, growth and differentiation factors, etc.) through a cascade of 
biochemical reactions that begins with the binding of a signaling molecule to a cell membrane 
receptor and ends with the activation of an intracellular target molecule. Intermediate steps in the 

20 process involve the activation of various cytoplasmic proteins by phosphorylation via protein kinases, 
and their deactivation by protein phosphatases, and the eventual translocation of some of these 
activated proteins to the cell nucleus where the transcription of specific genes is triggered. The 
intracellular signaling process regulates all types of cell functions including cell proliferation, cell 
differentiation, and gene transcription, and involves a diversity of molecules including protein 

25 kinases and phosphatases, and second messenger molecules such as cyclic nucleotides, calcium- 
calmodulin, inositol, and various mitogens that regulate protein phosphorylation. 

Intracellular signaling is carried out by a variety of molecules that promote the transduction 
and amplification of the signal. For example, binding of a ligand to a transmembrane receptor 
activates membrane-associated intracellular proteins, such as G-proteins. G-proteins mediate both the 

30 level of intracellular second messengers, such as cyclic AMP, and the activity of signaling enzymes, 
such as phospholipase C. These messengers and enzymes then activate signal transduction pathways, 
many of which are mediated by protein kinase cascades. Phosphorylation of proteins in response to 
extracellular signals, cell cycle checkpoints, and environmental or nutritional stresses is often 
accomplished by transfer of a high energy phosphate from ATP. Second messengers whose effects 

35 are mediated by protein kinases include cyclic AMP, cyclic GMP, inositol triphosphate, cyclic ADP 



1 



WO 00/77040 PCT/US00/16636 

ribose, and calcium/calmodulin. Alternatively, binding of ligand to a transmembrane receptor, such 
as a receptor tyrosine kinase, triggers the activation of a molecular "switch," such as a monomelic 
GTPase. In this case, binding of ligand to the receptor activates a catalytic domain in the intracellular 
portion of the receptor. This activated domain then switches on the activity of monomelic GTPases 
5 such as Ras, usually via adaptor proteins. 

Cells also respond to changing conditions by switching off signals. Many signal transduction 
proteins are short-lived and rapidly targeted for degradation by covalent ligation to ubiquitin, a highly 
conserved small protein. Cells also maintain mechanisms to monitor changes in the concentration of 
denatured or unfolded proteins in membrane-bound extracytoplasmic compartments, including a 
10 transmembrane receptor that monitors the concentration of available chaperone molecules in the 
endoplasmic reticulum and transmits a signal to the cytosol to activate the transcription of nuclear 
genes encoding chaperones in the endoplasmic reticulum. 

Certain proteins in intracellular signaling pathways serve to link or cluster other proteins 
involved in the signaling cascade. These proteins are referred to as scaffold, anchoring, or adaptor 
15 proteins. (For review, see Pawson, T., and Scott, J.D. (1997) Science 278:2075-2080.) As many 
intracellular signaling proteins such as protein kinases and phosphatases have relatively broad 
substrate specificities, the adaptors help to organize the component signaling proteins into specific 
biocehmical pathways. 

Gangliosides, generally associated with plasma membranes, also participate in signal 
20 transduction. Aberrant ganglioside function has been implicated in inflammatory and degenerative 
diseases within and outside of the nervous system, including Tay-Sachs disease, multiple sclerosis, 
lupus erythematosus, and insulin-dependent diabetes mellitus (Misasi, R. et al. (1997) Diabetes 
Metab.Rev. 13:163-179). 

Many of the above signaling molecules are characterized by the presence of particular 
25 domains that promote protein-protein interactions. A sampling of these domains is discussed below, 
along with other important intracellular messengers. 

Intracellular Signaling Second Messenger Molecules 

Phospholipid and Inositol-phosphate Signaling 

30 Inositol phospholipids (phosphoinositides) are involved in an intracellular signaling pathway 

that begins with binding of a signaling molecule to a G-protein linked receptor in the plasma 
membrane. This leads to the phosphorylation of phosphatidylinositol (PI) residues on the inner side 
of the plasma membrane to the biphosphate state (PIP 2 ) by inositol kinases. Simultaneously, the G- 
protein linked receptor binding stimulates a trimeric G-protein which in turn activates a 

35 phosphoinositide-specific phospholipase C-p. Phospholipase C-P then cleaves PIP 2 into two 
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products, inositol triphosphate (IP 3 ) and diacylglyceroi. These two products act as mediators for 
separate signaling events. IP 3 diffuses through the plasma membrane to induce calcium release from 
the endoplasmic reticulum (ER), while diaacylglycerol remains in the membrane and helps activate 
protein kinase C, an STK that phosphorylates selected proteins in the target cell. The calcium 
5 response initiated by IP 3 is terminated by the dephosphorylation of IP 3 by specific inositol 

phosphatases. Cellular responses that are mediated by this pathway are glycogen breakdown in the 
liver in response to vasopressin, smooth muscle contraction in response to acetylcholine, and 
thrombin-induced platelet aggregation. 
Cyclic Nucleotide Signaling 

10 Cyclic nucleotides (cAMP and cGMP) function as intracellular second messengers to 

transduce a variety of extracellular signals including hormones, light, and neurotransmitters. In 
particular , cyclic-AMP dependent protein kinases (PKA) are thought to account for all of the effects 
of cAMP in most mammalian cells, including various hormone-induced cellular responses. Visual 
excitation and the phototransmission of light signals in the eye is controlled by cyclic-GMP 

15 regulated, Ca 2+ -specific channels. Because of the importance of cellular levels of cyclic nucleotides 
in mediating these various responses, regulating the synthesis and breakdown of cyclic nucleotides is 
an important matter. Thus adenylyl cyclase, which synthesizes cAMP from AMP, is activated to 
increase cAMP levels in muscle by binding of adrenaline to p-andrenergic receptors, while activation 
of guanylate cyclase and increased cGMP levels in photoreceptors leads to reopening of the 

20 Ca 2+ -specific channels and recovery of the dark state in the eye. In contrast, hydrolysis of cyclic 

nucleotides by cAMP and cGMP-specific phosphodiesterases (PDEs) produces the opposite of these 
and other effects mediated by increased cyclic nucleotide levels. PDEs appear to be particularly 
important in the regulation of cyclic nucleotides, considering the diversity found in this family of 
proteins. At least seven families of mammalian PDEs (PDE 1-7) have been identified based on 

25 substrate specificity and affinity, sensitivity to cofactors, and sensitivity to inhibitory drugs (Beavo, 
J.A. (1995) Physiological Reviews 75:725-48). PDE inhibitors have been found to be particularly 
useful in treating various clinical disorders. Rolipram, a specific inhibitor of PDE4, has been used in 
the treatment of depression, and similar inhibitors are undergoing evaluation as anti-inflammatory 
agents. Theophylline is a nonspecific PDE inhibitor used in the treatment of bronchial asthma and 

30 other respiratory diseases (Banner, K.H. and Page, CP. (1995) Eur. Respir. J. 8:996-1000). 
Calcium Signaling Molecules 

Ca +2 is another second messenger molecule that is even more widely used as an intracellular 
mediator than cAMP. Two pathways exist by which Ca +2 can enter the cytosol in response to 
extracellular signals: One pathway acts primarily in nerve signal transduction where Ca +2 enters a 

35 nerve terminal through a voltage-gated Ca +2 channel. The second is a more ubiquitous pathway in 



3 



WO 00/77040 PCT/US00/16d36 

which Ca +2 is released from the ER into the cytosol in response to binding of an extracellular 
signaling molecule to a receptor. Ca 2 * directly activates regulatory enzymes, such as protein kinase 
C, which trigger signal transduction pathways. Ca 2+ also binds to specific Ca 2+ -binding proteins 
(CBPs) such as calmodulin (CaM) which then activate multiple target proteins in the cell including 

5 enzymes, membrane transport pumps, and ion channels. CaM interactions are involved in a 

multitude of cellular processes including, but not limited to, gene regulation, DNA synthesis, cell 
cycle progression, mitosis, cytokinesis, cytoskeletal organization, muscle contraction, signal 
transduction, ion homeostasis, exocytosis, and metabolic regulation (Celio, M.R. et al. (1996) 
Guidebook to Calcium-binding Proteins , Oxford University Press, Oxford, UK, pp. 15-20). Some 

10 Ca 2+ binding proteins are characterized by the presence of one or more EF-hand Ca 2+ binding motifs, 
which are comprised of 12 amino acids flanked by a-helices (Celio, supra ). The regulation of CBPs 
has implications for the control of a variety of disorders. Calcineurin, a CaM-regulated protein 
phosphatase, is a target for inhibition by the immunosuppressive agents cyclosporin and FK506. This 
indicates the importance of calcineurin and CaM in the immune response and immune disorders 

15 (Schwaninger M. et al. ( 1993) J. Biol Chem. 268:23 1 1 1-23 1 15). The level of CaM is increased 

several-fold in tumors and tumor-derived cell lines for various types of cancer (Rasmussen, CD. and 
Means, A.R. (1989) Trends in Neuroscience 12:433-438). 

The annexins are a family of calcium-binding proteins that associate with the cell membrane 
(Towle, C.A. and Treadwell, B.V. (1992) J. Biol. Chem. 267:5416-23). Annexins reversibly bind to 

20 negatively charged phospholipids (phosphatidylcholine and phosphatidylserine) in a calcium 

dependent manner. Annexins participate in various processes pertaining to signal transduction at the 
plasma membrane, including membrane-cytoskeleton interactions, phospholipase inhibition, 
anticoagulation, and membrane fusion. Annexins contain four to eight repeated segments of about 6G 
residues. Each repeat folds into five alpha helices wound into a right-handed superhelix. 

25 Signaling Complex Protein Domains 

PDZ domains were named for three proteins in which this domain was initially discovered. 
These proteins include PSD-95 (postsynaptic density 95), Dig ( Drosophila lethal(l)discs large-1), and 
ZO-1 (zonula occludens-1). These proteins play important roles in neuronal synaptic transmission, 
tumor suppression, and cell junction formation, respectively. Since the discovery of these proteins, 

30 over sixty additional PDZ-containing proteins have been identified in diverse prokaryotic and 

eukaryotic organisms. This domain has been implicated in receptor and ion channel clustering and in 
the targeting of multiprotein signaling complexes to specialized functional regions of the cytosolic 
face of the plasma membrane. (For review of PDZ domain-containing proteins, see Ponting, C P. et 
al. (1997) Bioessays 19:469-479.) A large proportion of PDZ domains are found in the eukaryotic 

35 MAGUK (membrane-associated guanylate kinase) protein family, members of which bind to the 
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intracellular domains of receptors and channels. However. PDZ domains are also found in diverse 
membrane-localized proteins such as protein tyrosine phosphatases, serine/threonine kinases, G- 
protein cofactors, and synapse-associated proteins such as syntrophins and neuronal nitric oxide 
synthase (nNOS). Generally, about one to three PDZ domains are found in a given protein, although 

5 up to nine PDZ domains have been identified in a single protein. The glutamate receptor interacting 
protein (GRIP) contains seven PDZ domains. GRIP is an adaptor that links certain glutamate 
receptors to other proteins and may be responsible for the clustering of these receptors at excitatory 
synapses in the brain (Dong, H. et ah (1997) Nature 386:279-284). 

The SH3 domain is defined by homology to a region of the proto-oncogene c-Src, a 

10 cytoplasmic protein tyrosine kinase. SH3 is a small domain of 50 to 60 amino acids that interacts 
with proline-rich ligands. SH3 domains are found in a variety of eukaryotic proteins involved in 
signal transduction, cell polarization, and membrane-cytoskeleton interactions. In some cases, SH3 
domain-containing proteins interact directly with receptor tyrosine kinases. For example, the SLAP- 
130 protein is a substrate of the T-cell receptor (TCR) stimulated protein kinase. SLAP- 130 interacts 

15 via its SH3 domain with the protein SLP-76 to affect the TCR-induced expression of interleukin-2 
(Musci, M.A. et al. (1997) J. Biol. Chem. 272:11674-1 1677). Another recently identified SH3 
domain protein is macrophage actin-associated tyrosine-phosphorylated protein (MAYP) which is 
phosphorylated during the response of macrophages to colony stimulating factor- 1 (CSF-1) and is 
likely to play a role in regulating the CSF-1 -induced reorganization of the actin cytoskeleton (Yeung, 

20 Y.-G. et al. (1998) J. Biol. Chem. 273:30638-30642). The structure of SH3 is characterized by two 
antiparallel beta sheets packed against each other at right angles. This packing forms a hydrophobic 
pocket lined with residues that are highly conserved between different SH3 domains. This pocket 
makes critical hydrophobic contacts with proline residues in the ligand (Feng, S. et al. (1994) Science 
266: 1241-47). Endophilin is an SH3 domain-containing protein implicated in synaptic vesicle 

25 endocytosis. (Micheva, K.D. (1997) 272:27239-27245). 

A novel domain, called the WW domain, resembles the SH3 domain in its ability to bind 
proline-rich ligands. This domain was originally discovered in dystrophin, a cytoskeletal protein with 
direct involvement in Duchenne muscular dystrophy (Bork, P. and Sudol, M. (1994) Trends Biochem. 
Sci. 19:531-533). WW domains have since been discovered in a variety of intracellular signaling 

30 molecules involved in development, cell differentiation, and cell proliferation. The structure of the 
WW domain is composed of beta strands grouped around four conserved aromatic residues, generally 
tryptophan. 

Like SH3, the SH2 domain is defined by homology to a region of c-Src. SH2 domains 
interact directly with phospho-tyrosine residues, thus providing an immediate mechanism for the 
35 regulation and transduction of receptor tyrosine kinase-mediated signaling pathways. For example, as 
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many as ten distinct SH2 domains are capable of binding to phosphorylated tyrosine residues in the 
activated PDGF receptor, thereby providing a highly coordinated and finely tuned response to ligand- 
mediated receptor activation. (Reviewed in Schaffhausen, B. (1995) Biochem. Biophys. Acta. 
1242:61-75.) 

5 Homer is a neuronal immediate early gene that is enriched at excitatory synapses (Xiao, B. et 

al. (1998) Neuron 21:707-716). Homer proteins form multivalent complexes that bind proline-rich 
motifs in group 1 metabotropic glutamate receptors and inositol triphosphate receptors, thereby 
coupling these receptors in a signaling complex (Tu, J.C. (1999) Neuron 23:583-592). 

The pleckstrin homology (PH) domain was originally identified in pleckstrin, the 

10 predominant substrate for protein kinase C in platelets. Since its discovery, this domain has been 

identified in over 90 proteins involved in intracellular signaling or cytoskeletal organization. Proteins 
containing the pleckstrin homology domain include a variety of kinases, phospholipase-C isoforms, 
guanine nucleotide release factors, and GTPase activating proteins. For example, members of the 
FGD1 family contain both Rho-guanine nucleotide exchange factor (GEF) and PH domains, as well 

15 as a FYVE zinc finger domain. FGD1 is the gene responsible for faciogenital dysplasia, an inherited 
skeletal dysplasia (Pasteris, N.G. and Gorski, J.L. (1999) Genomics 60:57-66). Many PH domain 
proteins function in association with the plasma membrane, and this association appears to be 
mediated by the PH domain itself. PH domains share a common structure composed of two 
antiparallel beta sheets flanked by an amphipathic alpha helix. Variable loops connecting the 

20 component beta strands generally occur within a positively charged environment and may function as 
ligand binding sites (Lemmon, M. A. et al. (1996) Cell 85:621-624.). n-Chimaerin is a GAP 
involved in the formation of lamellipodia and filopodia in neuroblastoma cells. (Kozma, R. et al. 
(1996) Mol. Cell Biol. 16:5069-5080.) 

Ankyrin (ANK) repeats mediate protein-protein interactions associated with diverse 

25 intracellular signaling functions. For example, ANK repeats are found in proteins involved in cell 
proliferation such as kinases, kinase inhibitors, tumor suppressors, and cell cycle control proteins. 
(See, for example, Kalus, W. et al. (1997) FEBS Lett. 401:127-132; Ferrante, A. W. et al. (1995) 
Proc. Natl. Acad. Sci. USA 92: 191 1-1915.) These proteins generally contain multiple ANK repeats, 
each composed of about 33 amino acids. Myotrophin is an ANK repeat protein that plays a key role 

30 in the development of cardiac hypertrophy, a contributing factor to many heart diseases. Structural 
studies show that the myotrophin ANK repeats, like other ANK repeats, each form a helix-turn-helix 
core preceded by a protruding "tip." These tips are of variable sequence and may play a role in 
protein-protein interactions. The helix-turn-helix region of the ANK repeats stack on top of one 
another and are stabilized by hydrophobic interactions (Yang, Y. et al. (1998) Structure 6:619-626). 

35 The tetratrico peptide repeat (TPR) is a 34 amino acid repeated motif found in organisms 
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from bacteria to humans. TPRs are predicted to form ampipathic helices, and appear to mediate 
protein-protein interactions. TPR domains are found in CDC 16, CDC23, and CDC27, members the 
the anaphase promoting complex which targets proteins for degradation at the onset of anaphase. 
Other processes involving TPR proteins include cell cycle control, transcription repression, stress 

5 response, and protein kinase inhibition. (Lamb, J.R. et al. (1995) Trends Biochem. Sci. 20:257-259.) 

The armadillo/beta-catenin repeat is a 42 amino acid motif which forms a superhelix of alpha 
helices when tandemly repeated. The structure of the armadillo repeat region from beta-catenin 
revealed a shallow groove of positive charge on one face of the superhelix, which is a potential 
binding surface. The armadillo repeats of beta-catenin, plakoglobin, and $120°** bind the cytoplasmic 

10 domains of cadherins. Beta-catenin/cadherin complexes are targets of regulatory signals that govern 
cell adhesion and mobility. (Huber, A.H. et al. (1997) Cell 90:871-882.) 

The discovery of new intracellular signaling proteins and the polynucleotides encoding them 
satisfies a need in the art by providing new compositions which are useful in the diagnosis, 
prevention, and treatment of cell proliferative, autoimmune/inflammatory, reproductive, and 

15 developmental disorders. 

SUMMARY OF THE INVENTION 

The invention features purified polypeptides, intracellular signaling molecules, referred to 
collectively as "INTRA" and individually as "INTRA- 1," "INTRA-2," "INTRA-3," "INTRA-4," 

20 "INTRA-5," "INTRA-6," "EMTRA-7," "INTRA-8," "INTRA-9," "INTRA-10," "INTRA- 1 1," 

"INTRA-12," "INTRA-13," "INTRA-14," "INTRA-15," "INTRA-16," "INTRA-17," "INTRA-18," 
"INTRA-19," "INTRA-20," "INTRA-21," "INTRA-22," "INTRA-23," "INTRA-24," "INTRA-25," 
"INTRA-26," "INTRA-27," "INTRA-28," "INTRA-29," "INTRA-30," "INTRA-31," "INTRA-32," 
"ESfTRA-33," "INTRA-34," "INTRA-35," 'TNTRA-36," "INTRA-37," "INTRA-38," "INTRA-39," 

25 "INTRA-40," "INTRA-4 1," "INTRA-42," "INTRA-43," "INTRA-44," "ENTRAPS , M "INTRA-46," 
"INTRA-47," "INTRA-48," "INTRA-49," "INTRA-50," "INTRA-Sl," and "INTRA-52." In one 
aspect, the invention provides an isolated polypeptide comprising an amino acid sequence selected 
from the group consisting of a) an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-52, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 

30 amino acid sequence selected from the group consisting of SEQ ID NO: 1-52, c) a biologically active 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-52, and d) 
an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ED 
NO: 1-52. In one alternative, the invention provides an isolated polypeptide comprising the amino 
acid sequence of SEQ ID NO: 1-52. 

35 The invention further provides an isolated polynucleotide encoding a polypeptide comprising 
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an amino acid sequence selected from the group consisting of a) an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-52, b) a naturally occurring amino acid sequence having 
at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-52, c) a biologically active fragment of an amino acid sequence selected from the group 

5 consisting of SEQ ID NO: 1-52, and d) an immunogenic fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-52. In one alternative, the polynucleotide encodes a 
polypeptide selected from the group consisting of SEQ ID NO: 1-52. In another alternative, the 
polynucleotide is selected from the group consisting of SEQ ID NO:53-104. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 

10 sequence operably linked to a polynucleotide encoding a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-52, b) a naturally occurring amino acid sequence having at least 90% 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-52, 
c) a biologically active fragment of an amino acid sequence selected from the group consisting of 

15 SEQ ID NO: 1-52, and d) an immunogenic fragment of an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-52. In one alternative, the invention provides a cell transformed 
with the recombinant polynucleotide. In another alternative, the invention provides a transgenic 
organism comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide comprising an amino acid 

20 sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-52, b) a naturally occurring amino acid sequence having at least 90% 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-52, 
c) a biologically active fragment of an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-52, and d) an immunogenic fragment of an amino acid sequence selected from the 

25 group consisting of SEQ ID NO: 1-52. The method comprises a) culturing a cell under conditions 
suitable for expression of the polypeptide, wherein said cell is transformed with a recombinant 
polynucleotide comprising a promoter sequence operably linked to a polynucleotide encoding the 
polypeptide, and b) recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 

30 polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-52, b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-52, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-52, and d) an immunogenic fragment of an amino 

35 acid sequence selected from the group consisting of SEQ ID NO: 1-52. 
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The invention further provides an isolated polynucleotide comprising a polynucleotide 
sequence selected from the group consisting of a) a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:53-104, b) a naturally occurring polynucleotide sequence having at least 
90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID 
5 NO:53-104, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence 
complementary to b), and e) an RNA equivalent of a)-d). In one alternative, the polynucleotide 
comprises at least 60 contiguous nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a 
sample, said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide 

10 sequence selected from the group consisting of a) a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:53-104, b) a naturally occurring polynucleotide sequence having at least 
90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:53-104, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence 
complementary to b), and e) an RNA equivalent of a)-d). The method comprises a) hybridizing the 

15 sample with a probe comprising at least 20 contiguous nucleotides comprising a sequence 

complementary to said target polynucleotide in the sample, and which probe specifically hybridizes to 
said target polynucleotide, under conditions whereby a hybridization complex is formed between said 
probe and said target polynucleotide or fragments thereof, and b) detecting the presence or absence of 
said hybridization complex, and optionally, if present, the amount thereof. In one alternative, the 

20 probe comprises at least 60 contiguous nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide 
sequence selected from the group consisting of a) a polynucleotide sequence selected from the group 
consisting of SEQ ED NO:53-104, b) a naturally occurring polynucleotide sequence having at least 

25 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:53-104, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence 
complementary to b), and e) an RNA equivalent of a)-d). The method comprises a) amplifying said 
target polynucleotide or fragment thereof using polymerase chain reaction amplification, and b) 
detecting the presence or absence of said amplified target polynucleotide or fragment thereof, and, 

30 optionally, if present, the amount thereof. 

The invention further provides a pharmaceutical composition comprising an effective amount 
of a polypeptide comprising an amino acid sequence selected from the group consisting of a) an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-52, b) a naturally occurring 
amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from 

35 the group consisting of SEQ ID NO: 1-52, c) a biologically active fragment of an amino acid sequence 
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selected from the group consisting of SEQ ID NO: 1-52, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-52, and a pharmaceutical^ 
acceptable excipient. In one embodiment, the pharmaceutical composition comprises an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-52. The invention additionally 
5 provides a method of treating a disease or condition associated with decreased expression of 
functional INTRA, comprising administering to a patient in need of such treatment the 
pharmaceutical composition. 

The invention also provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 
10 an amino acid sequence selected from the group consisting of SEQ ID NO: 1-52, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-52, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-52, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-52. The 
15 method comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting 
agonist activity in the sample. In one alternative, the invention provides a pharmaceutical 
composition comprising an agonist compound identified by the method and a pharmaceutically 
acceptable excipient. In another alternative, the invention provides a method of treating a disease or 
condition associated with decreased expression of functional INTRA, comprising administering to a 
20 patient in need of such treatment the pharmaceutical composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 
an antagonist of a polypeptide comprising an amino acid sequence selected from the group consisting 
of a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1-52, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
25 selected from the group consisting of SEQ ID NO: 1-52, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-52, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-52. The 
method comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting 
antagonist activity in the sample. In one alternative, the invention provides a pharmaceutical 
30 composition comprising an antagonist compound identified by the method and a pharmaceutically 
acceptable excipient. In another alternative, the invention provides a method of treating a disease or 
condition associated with overexpression of functional INTRA, comprising administering to a patient 
in need of such treatment the pharmaceutical composition. 

The invention further provides a method of screening for a compound that specifically binds 
35 to a polypeptide comprising an amino acid sequence selected from the group consisting of a) an 
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amino acid sequence selected from the group consisting of SEQ ID NO: 1-52, b) a naturally occurring 
amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-52, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-52, and d) an immunogenic fragment of an amino 
5 acid sequence selected from the group consisting of SEQ ID NO: 1-52. The method comprises a) 
combining the polypeptide with at least one test compound under suitable conditions, and b) 
detecting binding of the polypeptide to the test compound, thereby identifying a compound that 
specifically binds to the polypeptide. 

The invention further provides a method of screening for a compound that modulates the 

10 activity of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-52, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-52, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-52, and d) an immunogenic 

15 fragment of an amino acid sequence selected from the group consisting of SEQ ED NO: 1-52. The 
method comprises a) combining the polypeptide with at least one test compound under conditions 
permissive for the activity of the polypeptide, b) assessing the activity of the polypeptide in the 
presence of the test compound, and c) comparing the activity of the polypeptide in the presence of the 
test compound with the activity of the polypeptide in the absence of the test compound, wherein a 

20 change in the activity of the polypeptide in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NO:53-104, the method comprising a) 

25 exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered 
expression of the target polynucleotide. 



BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID NOs), 
30 clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full- 
length sequences encoding INTRA. 

Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 
sequences, and methods, algorithms, and searchable databases used for analysis of INTRA. 
Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific 
35 expression patterns of each nucleic acid sequence as determined by northern analysis; diseases, 
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disorders, or conditions associated with these tissues; and the vector into which each cDNA was 
cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 
encoding INTRA were isolated. 
5 Table 5 shows the tools, programs, and algorithms used to analyze the polynucleotides and 

polypeptides of the invention, along with applicable descriptions, references, and threshold 
parameters. 

DESCRIPTION OF THE INVENTION 

10 Before the present proteins, nucleotide sequences, and methods are described, it is understood 

that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which 
will be limited only by the appended claims. 

15 It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 

and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 
forth. 

20 Unless defined otherwise, all technical and scientific terms used herein have the same 

meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be 
used to practice or test the present invention, the preferred machines, materials and methods are now 
described. All publications mentioned herein are cited for the purpose of describing and disclosing 

25 the cell lines, protocols, reagents and vectors which are reported in the publications and which might 
be used in connection with the invention. Nothing herein is to be construed as an admission that the 
invention is not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"INTRA" refers to the amino acid sequences of substantially purified INTRA obtained from 
30 any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
INTRA. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of INTRA either by directly interacting with 
35 INTRA or by acting on components of the biological pathway in which INTRA participates. 
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An "allelic variant" is an alternative form of the gene encoding INTRA. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
5 allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times 
in a given sequence. 

"Altered" nucleic acid sequences encoding INTRA include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as INTRA or a 

10 polypeptide with at least one functional characteristic of INTRA. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe 
of the polynucleotide encoding INTRA, and improper or unexpected hybridization to allelic variants, 
with a locus other than the normal chromosomal locus for the polynucleotide sequence encoding 
INTRA. The encoded protein may also be "altered," and may contain deletions, insertions, or 

15 substitutions of amino acid residues which produce a silent change and result in a functionally 
equivalent INTRA. Deliberate amino acid substitutions may be made on the basis of similarity in 
polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the 
residues, as long as the biological or immunological activity of INTRA is retained. For example, 
negatively charged amino acids may include aspartic acid and glutamic acid, and positively charged 

20 amino acids may include lysine and arginine. Amino acids with uncharged polar side chains having 
similar hydrophilicity values may include: asparagine and glutamine; and serine and threonine. 
Amino acids with uncharged side chains having similar hydrophilicity values may include: leucine, 
isoleucine, and valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 

25 polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 
protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid 
sequence to the complete native amino acid sequence associated with the recited protein molecule. 
"Amplification" relates to the production of additional copies of a nucleic acid sequence. 

30 Amplification is generally carried out using polymerase chain reaction (PGR) technologies well 
known in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
of INTRA. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of INTRA either by 
35 directly interacting with INTRA or by acting on components of the biological pathway in which 
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INTRA participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments 
thereof, such as Fab, F(ab') 2 , and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind INTRA polypeptides can be prepared using intact polypeptides or using 
5 fragments containing small peptides of interest as the immunizing antigen. The polypeptide or 
oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the 
translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. 
Commonly used carriers that are chemically coupled to peptides include bovine serum albumin, 
thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize 
10 the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-dimensional structures 

15 on the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" 
(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; 
RNA; peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 

20 phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2 -methoxyethyl sugars or 2'-methoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5-methyl cytosine, 2'-deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 

25 nucleic acid sequence produced by the cell to form duplexes which block either transcription or 
translation. The designation "negative" or "minus" can refer to the antisense strand, and the 
designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 

30 refers to the capability of the natural, recombinant, or synthetic INTRA, or of any oligopeptide 

thereof, to induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 

"Complementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, 5-AGT-3' pairs with its complement, 
35 3-TCA-5'. 
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A "composition comprising a given polynucleotide sequence" and a "composition comprising 
a given amino acid sequence" refer broadly to any composition containing the given polynucleotide 
or amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding INTRA or fragments of INTRA may be 

5 employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl 
sulfate; SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 

10 DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (PE Biosystems, 
Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which has been assembled from 
one or more overlapping cDNA, EST, or genomic DNA fragments using a computer program for 
fragment assembly, such as the GELVIEW fragment assembly system (GCG, Madison WI) or Phrap 
(University of Washington, Seattle WA). Some sequences have been both extended and assembled to 

15 produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of 
the protein is conserved and not significantly changed by such substitutions. The table below shows 
amino acids which may be substituted for an original amino acid in a protein and which are regarded 

20 as conservative amino acid substitutions. 





Original Residue 


Conservative Substitution 




Ala 


Gly, Ser 




Arg 


His, Lys 




Asn 


Asp, Gin, His 


25 


Asp 


Asn, Glu 




Cys 


Ala, Ser 




Gin 


Asn, Glu, His 




Glu 


Asp, Gin, His 




Gly 


Ala 


30 


His 


Asn, Arg, Gin, Glu 




Ee 


Leu, Val 




Leu 


lie, Val 




Lys 


Arg, Gin, Glu 




Met 


Leu, He 


35 


Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 




Thr 


Ser, Val 




Trp 


Phe, Tyr 




Tyr 


His, Phe, Tip 


40 


Val 


He, Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
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backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of 
the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
5 absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 
Chemical modifications of a polynucleotide sequence can include, for example, replacement of 
hydrogen by an alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a 
polypeptide which retains at least one biological or immunological function of the natural molecule. 
10 A derivative polypeptide is one modified by glycosylation, pegylation, or any similar process that 
retains at least one biological or immunological function of the polypeptide from which it was 
derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

15 A "fragment" is a unique portion of INTRA or the polynucleotide encoding INTRA which is 

identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 
to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, 
a fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 

20 15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 
residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 

25 specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 

A fragment of SEQ ID NO:53-104 comprises a region of unique polynucleotide sequence 
that specifically identifies SEQ ID NO:53-104, for example, as distinct from any other sequence in 
the genome from which the fragment was obtained. A fragment of SEQ ID NO:53-104 is useful, for 
30 example, in hybridization and amplification technologies and in analogous methods that distinguish 
SEQ ID NO:53-104 from related polynucleotide sequences. The precise length of a fragment of SEQ 
ID NO:53-104 and the region of SEQ ID NO:53-104 to which the fragment corresponds are routinely 
determinable by one of ordinary skill in the art based on the intended purpose for the fragment. 
A fragment of SEQ ID NO: 1-52 is encoded by a fragment of SEQ ID NO:53-104. A 
35 fragment of SEQ ID NO: 1-52 comprises a region of unique amino acid sequence that specifically 
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identifies SEQ ID NO: 1-52. For example, a fragment of SEQ ID NO: 1-52 is useful as an 
immunogenic peptide for the development of antibodies that specifically recognize SEQ ED NO: 1-52. 
The precise length of a fragment of SEQ ID NO: 1-52 and the region of SEQ ID NO: 1-52 to which the 
fragment corresponds are routinely determinable by one of ordinary skill in the art based on the 
5 intended purpose for the fragment. 

A "full-length" polynucleotide sequence is one containing at least a translation initiation 
codon (e.g.. methionine) followed by an open reading frame and a translation termination codon. A 
"full-length" polynucleotide sequence encodes a "full-length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between 

10 two or more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps 
in the sequences being compared in order to optimize alignment between two sequences, and 

15 therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
sequence alignment program. This program is part of the LASERGENE software package, a suite of 
molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in 

20 Higgins, D.G. and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 
8:189-191. For pairwise alignments of polynucleotide sequences, the default parameters are set as 
follows: Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue 
weight table is selected as the default. Percent identity is reported by CLUSTAL V as the "percent 
similarity** between aligned polynucleotide sequences. 

25 Alternatively, a suite of commonly used and freely available sequence comparison algorithms 

is provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment 
Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available 
from several sources, including the NCBI, Bethesda, MD, and on the Internet at 
http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence 

30 analysis programs including "blastn." that is used to align a known polynucleotide sequence with 

other polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bI2.html. 
The "BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 

35 programs are commonly used with gap and other parameters set to default settings. For example, to 
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compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0. 12 (April-2 1-2000) set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Reward for match: 1 
5 Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 

Gap x drop-off: 50 

Expect: 10 

Word Size: 11 
10 Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, 
as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, 
over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at 
least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous 

15 nucleotides. Such lengths are exemplary only, and it is understood that any fragment length 

supported by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes 

20 in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 

25 alignment methods take into account conservative amino acid substitutions. Such conservative 

substitutions, explained in more detail above, generally preserve the charge and hydrophobicity at the 
site of substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3.12e 

30 sequence alignment program (described and referenced above). For pairwise alignments of 

polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 
penalty=3, window=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default 
residue weight table. As with polynucleotide alignments, the percent identity is reported by 
CLUSTAL V as the "percent similarity" between aligned polypeptide sequence pairs. 

35 Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
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comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
2.0.12 (Apr-2 1-2000) with blastp set at default parameters. Such default parameters may be, for 
example: 

Matrix: BLOSUM62 
5 Open Gap: 11 and Extension Gap: 1 penalties 

Gap x drop-off: 50 

Expect: 10 

Word Size: 3 

Filter: on 

10 Percent identity may be measured over the length of an entire defined polypeptide sequence, 

for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 
instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 
150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment 

15 length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be 
used to describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

20 The term "humanized antibody" refers to an antibody molecule in which the amino acid 

sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 

25 hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in determining the 
stringency of the hybridization process, with more stringent conditions allowing less non-specific 
binding, i.e., binding between pairs of nucleic acid strands that are not perfectly matched. Permissive 

30 conditions for annealing of nucleic acid sequences are routinely determinable by one of ordinary skill 
in the art and may be consistent among hybridization experiments, whereas wash conditions may be 
varied among experiments to achieve the desired stringency, and therefore hybridization specificity. 
Permissive annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 
1% (w/v) SDS, and about 100 Mg/ml sheared, denatured salmon sperm DNA. 

35 Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
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under which the wash step is carried out. Such wash temperatures are typically selected to be about 
5°C to 20°C lower than the thermal melting point (T m ) for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and 

5 conditions for nucleic acid hybridization are well known and can be found in Sambrook, J. et al., 
1989, Molecular Cloning: A Laboratory Manual , 2 nd ed., vol. 1-3, Cold Spring Harbor Press, 
Plainview NY; specifically see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present 
invention include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, 

10 for 1 hour. Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC 
concentration may be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. 
Typically, blocking reagents are used to block non-specific hybridization. Such blocking reagents 
include, for instance, sheared and denatured salmon sperm DNA at about 100-200 /xg/ml. Organic 
solvent, such as formamide at a concentration of about 35-50% v/v, may also be used under particular 

15 circumstances, such as for RNA:DNA hybridizations. Useful variations on these wash conditions 
will be readily apparent to those of ordinary skill in the art. Hybridization, particularly under high 
stringency conditions, may be suggestive of evolutionary similarity between the nucleotides. Such 
similarity is strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 
The term "hybridization complex" refers to a complex formed between two nucleic acid 

20 sequences by virtue of the formation of hydrogen bonds between complementary bases. A 

hybridization complex may be formed in solution (e.g., Qt or Rot analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate 
to which cells or their nucleic acids have been fixed). 

25 The words "insertion" and "addition" refer to changes in an amino acid or nucleotide 

sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 

30 cellular and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of INTRA which is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment 
of INTRA which is useful in any of the antibody production methods disclosed herein or known in 

35 the art. 
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The term "microarray" refers to an arrangement of a plurality of polynucleotides, 
polypeptides, or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 
5 The term "modulate" refers to a change in the activity of INTRA. For example, modulation 

may cause an increase or a decrease in protein activity, binding characteristics, or any other 
biological, functional, or immunological properties of INTRA. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
10 synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
15 sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 
20 PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript 
elongation, and may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an INTRA may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in 
the art. These processes may occur synthetically or biochemically. Biochemical modifications will 
25 vary by cell type depending on the enzymatic milieu of INTRA. 

"Probe" refers to nucleic acid sequences encoding INTRA, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 
Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. 
30 "Primers" are short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target 
polynucleotide by complementary base-pairing. The primer may then be extended along the target 
DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification (and 
identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
35 nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
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be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 

5 Methods for preparing and using probes and primers are described in the references, for 

example Sambrook, J. et al., 1989, Molecular Cloning: A Laboratory Manual , 2 nd ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel, F.M. et al.,1987, Current Protocols in M olecular 
Biology , Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis, M. et aL, 1990, PCR 
Protocols. A Guide to Methods and Applications , Academic Press, San Diego CA. PCR primer pairs 

10 can be derived from a known sequence, for example, by using computer programs intended for that 
purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 
MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 

15 100 nucleotides each, and for the analysis .of oligonucleotides and larger polynucleotides of up to 
5,000 nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer 
selection programs have incorporated additional features for expanded capabilities. For example, the 
PrimOU primer selection program (available to the public from the Genome Center at University of 
Texas South West Medical Center, Dallas TX) is capable of choosing specific primers from 

20 megabase sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 
primer selection program (available to the public from the Whitehead Institute/MIT Center for 
Genome Research, Cambridge MA) allows the user to input a "mispriming library," in which 
sequences to avoid as primer binding sites are user-specified. Primer3 is useful, in particular, for the 
selection of oligonucleotides for microarrays. (The source code for the latter two primer selection 

25 programs may also be obtained from their respective sources and modified to meet the user's specific 
needs.) The PrimeGen program (available to the public from the UK Human Genome Mapping 
Project Resource Centre, Cambridge UK) designs primers based on multiple sequence alignments, 
thereby allowing selection of primers that hybridize to either the most conserved or least conserved 
regions of aligned nucleic acid sequences. Hence, this program is useful for identification of both 

30 unique and conserved oligonucleotides and polynucleotide fragments. The oligonucleotides and 

polynucleotide fragments identified by any of the above selection methods are useful in hybridization 
technologies, for example, as PCR or sequencing primers, microarray elements, or specific probes to 
identify fully or partially complementary polynucleotides in a sample of nucleic acids. Methods of 
oligonucleotide selection are not limited to those described above. 

35 A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
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that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 
5 been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter 
sequence. Such a recombinant nucleic acid may be part of a vector that is used, for example, to 
transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 

10 vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5* and 3' untranslated regions 
(UTRs). Regulatory elements interact with host or viral proteins which control transcription, 

15 translation, or RNA stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
other moieties known in the art. 

20 An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 

sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 

25 acids encoding INTRA, or fragments thereof, or INTRA itself, may comprise a bodily fluid; an 

extract from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, 
RNA, or cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 

30 synthetic binding composition. The interaction is dependent upon the presence of a particular 
structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding 
molecule. For example, if an antibody is specific for epitope "A," the presence of a polypeptide 
comprising the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A 
and the antibody will reduce the amount of labeled A that binds to the antibody. 

35 The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
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removed from their natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75% free, and most preferably at least 90% free from other components with which 
they are naturally associated. 

A "substitution'' refers to the replacement of one or more amino acid residues or nucleotides 

5 by different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

10 A "transcript image" refers to the collective pattern of gene expression by a particular cell 

type or tissue under given conditions at a given time. 

"Transformation" describes a process by which exogenous DNA is introduced into a recipient 
cell. Transformation may occur under natural or artificial conditions according to various methods 
well known in the art, and may rely on any known method for the insertion of foreign nucleic acid 

15 sequences into a prokaryotic or eukaryotic host cell. The method for transformation is selected based 
on the type of host cell being transformed and may include, but is not limited to, bacteriophage or 
viral infection, electroporation, heat shock, lipofection, and particle bombardment. The term 
"transformed" cells includes stably transformed cells in which the inserted DNA is capable of 
replication either as an autonomously replicating plasmid or as part of the host chromosome, as well 

20 as transiently transformed cells which express the inserted DNA or RNA for limited periods of time. 
A "transgenic organism," as used herein, is any organism, including but not limited to 
animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 
art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor 

25 of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 
a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in 
vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria, 'fungi, plants, and animals. The isolated DNA of the present invention can be 

30 introduced into the host by methods known in the art, for example infection, transfection, 

transformation or transconjugation. Techniques for transferring the DNA of the present invention 
into such organisms are widely known and provided in references such as Sambrook et al. (1989), 
supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
35 at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 



24 



WO 00/77040 PCT/USOO/16636 

the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or 
greater sequence identity over a certain defined length. A variant may be described as, for example, 

5 an "allelic" (as defined above), "splice," "species," or "polymorphic" variant. A splice variant may 
have significant identity to a reference molecule, but will generally have a greater or lesser number of 
polynucleotides due to alternative splicing of exons during mRNA processing. The corresponding 
polypeptide may possess additional functional domains or lack domains that are present in the 
reference molecule. Species variants are polynucleotide sequences that vary from one species to 

10 another. The resulting polypeptides generally will have significant amino acid identity relative to 
each other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene 
between individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The 
presence of SNPs may be indicative of, for example, a certain population, a disease state, or a 

15 propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 
at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at 

20 least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% or greater sequence 
identity over a certain defined length of one of the polypeptides. 
THE INVENTION 

The invention is based on the discovery of new human intracellular signaling molecules 
(INTRA), the polynucleotides encoding INTRA, and the use of these compositions for the diagnosis, 

25 treatment, or prevention of cell proliferative, autoimmune/inflammatory, neurological, 
gastrointestinal, reproductive, and developmental disorders. 

Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding 
INTRA. Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the 
polypeptide and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte 

30 clones in which nucleic acids encoding each INTRA were identified, and column 4 shows the cDNA 
libraries from which these clones were isolated. Column 5 shows Incyte clones and their 
corresponding cDNA libraries. Clones for which cDNA libraries are not indicated were derived from 
pooled cDNA libraries. In some cases, GenBank sequence identifiers are also shown in column 5. 
The Incyte clones and GenBank cDNA sequences, where indicated, in column 5 were used to 

35 assemble the consensus nucleotide sequence of each INTRA and are useful as fragments in 

25 
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hybridization technologies. 

The columns of Table 2 show various properties of each of the polypeptides of the invention: 
column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylation 

5 sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 
shows homologous sequences as identified by BLAST analysis along with relevant citations, all of 
which are expressly incorporated by reference herein in their entirety; and column 7 shows analytical 
methods and in some cases, searchable databases to which the analytical methods were applied. The 
methods of column 7 were used to characterize each polypeptide through sequence homology and 

10 protein motifs. 

The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 
associated with nucleotide sequences encoding INTRA. The first column of Table 3 lists the 
nucleotide SEQ ID NOs. Column 2 lists fragments of the nucleotide sequences of column 1. These 
fragments are useful, for example, in hybridization or amplification technologies to identify SEQ ID 

15 NO:53-104 and to distinguish between SEQ ID NO:53-104 and related polynucleotide sequences. 
The polypeptides encoded by these fragments are useful, for example, as immunogenic peptides. 
Column 3 lists tissue categories which express INTRA as a fraction of total tissues expressing 
INTRA. Column 4 lists diseases, disorders, or conditions associated with those tissues expressing 
INTRA as a fraction of total tissues expressing INTRA. Column 5 lists the vectors used to subclone 

20 each cDNA library. Of particular interest is the expression of SEQ ID NO: 88 and SEQ ID NO:94 in 
reproductive tissues, of SEQ ID NO:99, SEQ ID NO: 100, and SEQ ID NO: 103 in 
hematopoietic/immune tissues, and of SEQ ID NO:96 in cardiovascular tissues. 

The columns of Table 4 show descriptions of the tissues used to construct the cDNA libraries 
from which cDNA clones encoding INTRA were isolated. Column 1 references the nucleotide SEQ 

25 ID NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 

shows the tissue origins and other descriptive information relevant to the cDNA libraries in column 2. 

SEQ ED NO:58 maps to chromosome 7 within the interval from 84.40 to 90.30 centiMorgans. 
This interval also contains an EST with high similarity to thyroid disease hypothetical autoantigen. 
SEQ ID NO:67 maps to chromosome 16 within the interval from 1 19.20 centiMorgans to q-terminus. 

30 This interval also contains the paraplegin gene, mutations in which cause spastic paraplegia and 
OXPHOS impairment. SEQ ID NO:70 maps to chromosome 1 1 within the interval from 59.50 to 
62.50 centiMorgans. SEQ ID NO:71 maps to chromosome 7 within the interval from 138.0 to 145.8 
centiMorgans. SEQ ID NO:73 maps to chromosome 12 within the interval from 76.5 to 84.2 
centiMorgans. SEQ ID NO:77 maps to chromosome 7 within the interval from 4.8 to 10.6 

35 centiMorgans and to chromosome 4 within the interval from 56.7 to 60.5 centiMorgans. The interval 
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on chromosome 7 from from 4.8 to 10.6 centiMorgans also contains a gene associated with cell 
proliferation. The interval on chromosome 4 from 56.7 to 60.5 centiMorgans also contains a gene 
associated with cell proliferation. SEQ ID NO:79 maps to chromosome 15 within the interval from 
32.2 to 47. 1 centiMorgans. This interval also contains a gene associated with cell proliferation. SEQ 

5 ID NO:80 maps to chromosome 20 within the interval from 50.2 to 53.6 centiMorgans. This interval 
also contains a gene associated with cell differentiation. SEQ ID NO:84 maps to chromosome 3 
within the interval from 142.2 to 148.7 centiMorgans. SEQ ID NO:87 maps to chromosome 5 within 
the interval from 141.4 to 147.1 centiMorgans. SEQ ID NO:91 maps to chromosome 12 within the 
interval from 62.7 to 67.3 centiMorgans. SEQ ID NO:95 maps to chromosome 15 within the interval 

10 from 45.5 to 58.8 centiMorgans. SEQ ID NO:97 maps to the X chromosome within the interval from 
1 12.8 to 139.4 centiMorgans. 

The invention also encompasses INTRA variants. A preferred INTRA variant is one which 
has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the INTRA amino acid sequence, and which contains at least one functional or 

15 structural characteristic of INTRA. 

The invention also encompasses polynucleotides which encode INTRA. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ ID NO:53-104, which encodes INTRA. The polynucleotide 
sequences of SEQ ID NO:53-104, as presented in the Sequence Listing, embrace the equivalent RNA 

20 sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the 
sugar backbone is composed of ribose instead of deoxyribose. 

The invention also encompasses a variant of a polynucleotide sequence encoding INTRA. In 
particular, such a variant polynucleotide sequence will have at least about 80%, or alternatively at 
least about 90%, or even at least about 95% polynucleotide sequence identity to the polynucleotide 

25 sequence encoding INTRA. A particular aspect of the invention encompasses a variant of a 
polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID 
NO:53-104 which has at least about 80%, or alternatively at least about 90%, or even at least about 
95% polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting 
of SEQ ID NO:53-104. Any one of the polynucleotide variants described above can encode an amino 

30 acid sequence which contains at least one functional or structural characteristic of INTRA. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding INTRA, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 

35 sequence that could be made by selecting combinations based on possible codon choices. These 
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combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring INTRA, and all such variations are to be considered 
as being specifically disclosed. 

Although nucleotide sequences which encode INTRA and its variants are generally capable 

5 of hybridizing to the nucleotide sequence of the naturally occurring INTRA under appropriately 

selected conditions of stringency, it may be advantageous to produce nucleotide sequences encoding 
INTRA or its derivatives possessing a substantially different codon usage, e.g., inclusion of non- 
naturally occurring codons. Codons may be selected to increase the rate at which expression of the 
peptide occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with 

10 which particular codons are utilized by the host. Other reasons for substantially altering the 

nucleotide sequence encoding INTRA and its derivatives without altering the encoded amino acid 
sequences include the production of RNA transcripts having more desirable properties, such as a 
greater half-life, than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode INTRA and 

15 INTRA derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 
introduce mutations into a sequence encoding INTRA or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 

20 hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:53-104 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 
152:507-51 1.) Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions." 

25 Methods for DNA sequencing are well known in the art and may be used to practice any of 

the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 
of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (PE 
Biosy stems, Foster City CA), thermostable T7 polymerase (Amersham Pharmacia Biotech, 
Piscataway NJ), or combinations of polymerases and proofreading exonucleases such as those found 

30 in the ELONGASE amplification system (Life Technologies, Gaithersburg MD). Preferably, 
sequence preparation is automated with machines such as the MICROLAB 2200 liquid transfer 
system (Hamilton, Reno NV), PTC200 thermal cycler (MJ Research, Watertown MA) and ABI 
CATALYST 800 thermal cycler (PE Biosystems). Sequencing is then carried out using either the 
ABI 373 or 377 DNA sequencing system (PE Biosystems), the MEG AB ACE 1000 DNA sequencing 

35 system (Molecular Dynamics, Sunnyvale CA), or other systems known in the art. The resulting 
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sequences are analyzed using a variety of algorithms which are well known in the art. (See, e.g., 
Ausubel, F.M. (1997) Short Protocols in Molecular Biology , John Wiley & Sons, New York NY, unit 
7.7; Meyers, R.A. (1995) Molecular Biology and Biotechnology , Wiley VCH, New York NY, pp. 
856-853.) 

5 The nucleic acid sequences encoding INTRA may be extended utilizing a partial nucleotide 

sequence and employing various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory elements. For example, one method which may be employed, 
restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) 

10 Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising 
a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 
Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments 
adjacent to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, 

15 M. et al. (1991) PCR Methods Applic. 1: 1 1 1-1 19.) In this method, multiple restriction enzyme 

digestions and ligations may be used to insert an engineered double-stranded sequence into a region 
of unknown sequence before performing PCR. Other methods which may be used to retrieve 
unknown sequences are known in the art. (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 
19:3055-3060). Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries 

20 (Clontech, Palo Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries 
and is useful in finding intron/exon junctions. For all PCR-based methods, primers may be designed 
using commercially available software, such as OLIGO 4.06 Primer Analysis software (National 
Biosciences, Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in 
length, to have a GC content of about 50% or more, and to anneal to the template at temperatures of 

25 about 68°C to 72°C. 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 

30 into 5' non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze 
the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 

35 emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
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software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, PE Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sample. 
5 In another embodiment of the invention, polynucleotide sequences or fragments thereof 

which encode INTRA may be cloned in recombinant DNA molecules that direct expression of 
INTRA, or fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent 
degeneracy of the genetic code, other DNA sequences which encode substantially the same or a 
functionally equivalent amino acid sequence may be produced and used to express INTRA. 

10 The nucleotide sequences of the present invention can be engineered using methods generally 

known in the art in order to alter INTRA-encoding sequences for a variety of purposes including, but 
not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 

15 mediated site-directed mutagenesis may be used to introduce mutations that create new restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, RC. et al. (1999) Nat. 

20 Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or 

improve the biological properties of INTRA, such as its biological or enzymatic activity or its ability 
to bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
variants is produced using PCR-mediated recombination of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants with the desired 

25 properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point mutations may be recombined, screened, and then reshuffled until the desired properties are 
optimized. Alternatively, fragments of a given gene may be recombined with fragments of 

30 homologous genes in the same gene family, either from the same or different species, thereby 

maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 
manner. 

In another embodiment, sequences encoding INTRA may be synthesized, in whole or in pan, 
using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. (1980) Nucleic Acids 
35 Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) 
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Alternatively, INTRA itself or a fragment thereof may be synthesized using chemical methods. For 
example, peptide synthesis can be performed using various solution-phase or solid-phase techniques. 
(See, e.g., Creighton, T. (1984) Proteins. Structures and Molecular Properties , WH Freeman, New 
York NY, pp. 55-60; and Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis 
5 may be achieved using the ABI 431 A peptide synthesizer (PE Biosystems). Additionally, the amino 
acid sequence of INTRA, or any part thereof, may be altered during direct synthesis and/or combined 
with sequences from other proteins, or any part thereof, to produce a variant polypeptide or a 
polypeptide having a sequence of a naturally occurring polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 

10 chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by 
sequencing. (See, e.g., Creighton, supra , pp. 28-53.) 

In order to express a biologically active INTRA, the nucleotide sequences encoding INTRA 
or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which 

15 contains the necessary elements for transcriptional and translational control of the inserted coding 
sequence in a suitable host. These elements include regulatory sequences, such as enhancers, 
constitutive and inducible promoters, and 5 'and 3' untranslated regions in the vector and in 
polynucleotide sequences encoding INTRA. Such elements may vary in their strength and specificity. 
Specific initiation signals may also be used to achieve more efficient translation of sequences 

20 encoding INTRA. Such signals include the ATG initiation codon and adjacent sequences, e.g. the 
Kozak sequence. In cases where sequences encoding INTRA and its initiation codon and upstream 
regulatory sequences are inserted into the appropriate expression vector, no additional transcriptional 
or translational control signals may be needed. However, in cases where only coding sequence, or a 
fragment thereof, is inserted, exogenous translational control signals including an in-frame ATG 

25 initiation codon should be provided by the vector. Exogenous translational elements and initiation 
codons may be of various origins, both natural and synthetic. The efficiency of expression may be 
enhanced by the inclusion of enhancers appropriate for the particular host cell system used. (See, 
e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 20: 125-162.) 

Methods- which are well known to those skilled in the art may be used to construct expression 

30 vectors containing sequences encoding INTRA and appropriate transcriptional and translational 

control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, 
and in vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning. A 
Laboratory Manual . Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et 
al. (1995) Current Protocols in Molecular Biology . John Wiley & Sons, New York NY, ch. 9, 13, and 

35 16.) 
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A variety of expression vector/host systems may be utilized to contain and express sequences 
encoding INTRA. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 
5 plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, 
or tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
animal cell systems. (See, e.g., Sambrook, supra ; Ausubel, supra : Van Heeke, G. and S.M. Schuster 
(1989) J. Biol. Chem. 264:5503-5509; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; 
Scorer, C.A. et al. (1994) Bio/Technology 12:181-184; Engelhard, E.K. et al. (1994) Proc. Natl. 
10 Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. GeneTher. 7:1937-1945; Takamatsu, 
N. (1987) EMBO J. 6:307-31 1; Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. 
(1984) Science 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105; The 
McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 
191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and Harrington, 
15 JJ. et al. (1997) Nat. Genet. 15:345-355.) Expression vectors derived from retroviruses, 

adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for 
delivery of nucleotide sequences to the targeted organ, tissue, or cell population. (See, e.g., Di 
Nicola, M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et ah, (1993) Proc. Natl. Acad. Sci. 
USA 90(13):6340-6344; Buller, R.M. et al. (1985) Nature 317(6040):813-815; McGregor, D.P. et al. 
20 (1994) Mol. Immunol. 3 1(3):2 19-226; and Verma, I.M. and N. Somia (1997) Nature 389:239-242.) 
The invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding INTRA. For example, routine cloning, 
subcloning, and propagation of polynucleotide sequences encoding INTRA can be achieved using a 
25 multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT 1 
plasmid (Life Technologies). Ligation of sequences encoding INTRA into the vector's multiple 
cloning site disrupts the lacL gene, allowing a colorimetric screening procedure for identification of 
transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for 
in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of 
30 nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. Biol. 
Chem. 264:5503-5509.) When large quantities of INTRA are needed, e.g. for the production of 
antibodies, vectors which direct high level expression of INTRA may be used. For example, vectors 
containing the strong, inducible T5 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of INTRA. A number of vectors 
35 containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
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promoters, may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable 
integration of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 
1995, supra ; Bitter, supra ; and Scorer, supra .) 

5 Plant systems may also be used for expression of INTRA. Transcription of sequences 

encoding INTRA may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 
6:307-31 1). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock 
promoters may be used. (See, e.g., Coruzzi, supra ; Broglie, supra ; and Winter, supra .) These 

10 constructs can be introduced into plant cells by direct DNA transformation or pathogen-mediated 
transfection. (See, e.g., The McGraw Hill Yearbook of Science and Technology (1992) McGraw 
Hill, New York NY, pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding INTRA may be ligated into 

15 an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses INTRA in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 

20 based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat. Genet. 

25 15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression 
of INTRA in cell lines is preferred. For example, sequences encoding INTRA can be transformed 
into cell lines using expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a separate vector. 
30 Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 days in 

enriched media before being switched to selective media. The purpose of the selectable marker is to 
confer resistance to a selective agent, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed cells may be 
propagated using tissue culture techniques appropriate to the cell type. 
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Any number of selection systems may be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in tk' and apr cells, respectively. (See, e.g., Wigler, M. et 
al. (1977) Cell 11:223-232; Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, 

5 or herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance to 
methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als and par 
confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., 
Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) 
J. Mol. Biol. 150:1-14.) Additional selectable genes have been described, e.g., trpB and hisD, which 

10 alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. 
Natl. Acad. Sci. USA 85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins 
(GFP; Clontech), 8 glucuronidase and its substrate 6-glucuronide, or luciferase and its substrate 
luciferin may be used. These markers can be used not only to identify transformants, but also to 
quantify the amount of transient or stable protein expression attributable to a specific vector system. 

15 (See, e.g., Rhodes, C.A. (1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is 
also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding INTRA is inserted within a marker gene sequence, transformed cells containing 
sequences encoding INTRA can be identified by the absence of marker gene function. Alternatively, 

20 a marker gene can be placed in tandem with a sequence encoding INTRA under the control of a single 
promoter. Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding INTRA and that express 
INTRA may be identified by a variety of procedures known to those of skill in the art. These 

25 procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 

amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 
chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of INTRA using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 

30 include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 

fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on INTRA is preferred, but a 
competitive binding assay may be employed. These and other assays are well known in the art. (See, 
e.g., Hampton, R. et al. (1990) Serological Methods, a Laboratory Manual . APS Press, St. Paul MN, 

35 Sect. IV; Coligan, J.E. et al. (1997) Current Protocols in Immunology , Greene Pub. Associates and 
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Wiley-Interscience, New York NY; and Pound, J.D. (1998) Immunoche mical Protocols, Humana 
Press, Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
may be used in various nucleic acid and amino acid assays. Means for producing labeled 

5 hybridization or PCR probes for detecting sequences related to polynucleotides encoding INTRA 
include oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Alternatively, the sequences encoding INTRA, or any fragments thereof, may be cloned into a vector 
for the production of an mRNA probe. Such vectors are known in the art, are commercially available, 
and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 

10 such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety 
of commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 
(Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for 
ease of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic 
agents, as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

15 Host cells transformed with nucleotide sequences encoding INTRA may be cultured under 

conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained intracellularly depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors 
containing polynucleotides which encode INTRA may be designed to contain signal sequences which 

20 direct secretion of INTRA through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted sequences or to process the expressed protein in the desired fashion. Such modifications of 
the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 
phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" or 

25 "pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 
Different host cells which have specific cellular machinery and characteristic mechanisms for 
post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the 
American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
modification and processing of the foreign protein. 

30 In another embodiment of the invention, natural, modified, or recombinant nucleic acid 

sequences encoding INTRA may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For example, a chimeric INTRA protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of INTRA activity. Heterologous protein 

35 and peptide moieties may also facilitate purification of fusion proteins using commercially available 

35 
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affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), 
maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, 
c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their 
cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and 
5 metal-chelate resins, respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity 
purification of fusion proteins using commercially available monoclonal and polyclonal antibodies 
that specifically recognize these epitope tags. A fusion protein may also be engineered to contain a 
proteolytic cleavage site located between the INTRA encoding sequence and the heterologous protein 
sequence, so that INTRA may be cleaved away from the heterologous moiety following purification. 
10 Methods for fusion protein expression and purification are discussed in Ausubel (1995, supra , ch. 10). 
A variety of commercially available kits may also be used to facilitate expression and purification of 
fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled INTRA may be achieved 
in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These 

15 systems couple transcription and translation of protein-coding sequences operably associated with the 
T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
precursor, for example, 35 S-methionine. 

INTRA of the present invention or fragments thereof may be used to screen for compounds 
that specifically bind to INTRA. At least one and up to a plurality of test compounds may be 

20 screened for specific binding to INTRA. Examples of test compounds include antibodies, 
oligonucleotides, proteins (e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 
INTRA, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural binding partner. (See, Coligan, J.E. et al. (1991) Current Protocols in Immunology 1(2): 

25 Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which INTRA 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 
compound can be rationally designed using known techniques. In one embodiment, screening for 
these compounds involves producing appropriate cells which express INTRA, either as a secreted 
protein or on the cell membrane. Preferred cells include cells from mammals, yeast, Drosophila . or 

30 E. coli . Cells expressing INTRA or cell membrane fractions which contain INTRA are then 

contacted with a test compound and binding, stimulation, or inhibition of activity of either INTRA or 
the compound is analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 

35 the assay may comprise the steps of combining at least one test compound with INTRA, either in 
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solution or affixed to a solid support, and detecting the binding of INTRA to the compound. 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 
labeled competitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
libraries, or natural product mixtures, and the test compound(s) may be free in solution or affixed to a 
5 solid support. 

INTRA of the present invention or fragments thereof may be used to screen for compounds 
that modulate the activity of INTRA. Such compounds may include agonists, antagonists, or partial 
or inverse agonists. In one embodiment, an assay is performed under conditions permissive for 
INTRA activity, wherein INTRA is combined with at least one test compound, and the activity of 

10 INTRA in the presence of a test compound is compared with the activity of INTRA in the absence of 
the test compound. A change in the activity of INTRA in the presence of the test compound is 
indicative of a compound that modulates the activity of INTRA. Alternatively, a test compound is 
combined with an in vitro or cell-free system comprising INTRA under conditions suitable for 
INTRA activity, and the assay is performed. In either of these assays, a test compound which 

15 modulates the activity of INTRA may do so indirectly and need not come in direct contact with the 
test compound. At least one and up to a plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding INTRA or their mammalian homologs 
may be "knocked out" in an animal model system using homologous recombination in embryonic 
stem (ES) cells. Such techniques are well known in the art and are useful for the generation of animal 

20 models of human disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767,337.) For 
example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse 
embryo and grown in culture. The ES cells are transformed with a vector containing the gene of 
interest disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, 
M.R. (1989) Science 244: 1288-1292). The vector integrates into the corresponding region of the host 

25 genome by homologous recombination. Alternatively, homologous recombination takes place using 
the Cre-loxP system to knockout a gene of interest in a tissue- or developmental stage-specific 
manner (Marth, J.D. (1996) Clin. Invest. 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids 
Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell 
blastocysts such as those from the C57BL/6 mouse strain. The blastocysts are surgically transferred 

30 to pseudopregnant dams, and the resulting chimeric progeny are genotyped and bred to produce 

heterozygous or homozygous strains. Transgenic animals thus generated may be tested with potential 
therapeutic or toxic agents. 

Polynucleotides encoding INTRA may also be manipulated in vitro in ES cells derived from 
human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 

35 lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
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into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. 
(1998) Science 282:1145-1147). 

Polynucleotides encoding INTRA can also be used to create "knockin" humanized animals 
(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a 
5 region of a polynucleotide encoding INTRA is injected into animal ES cells, and the injected 

sequence integrates into the animal cell genome. Transformed cells are injected into blastulae, and 
the blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and 
treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 
Alternatively, a mammal inbred to overexpress INTRA, e.g., by secreting INTRA in its milk, may 
10 also serve as a convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55- 
74). 

THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of INTRA and intracellular signaling molecules. In addition, the expression of 

15 INTRA is closely associated with cancers of the hematopoetic/immune, nervous, gastrointestinal, and 
reproductive, systems therefore, INTRA appears to play a role in cell proliferative, 
autoimmune/inflammatory, neurological, gastrointestinal, reproductive, and developmental disorders. 
In the treatment of disorders associated with increased INTRA expression or activity, it is desirable to 
decrease the expression or activity of INTRA. In the treatment of disorders associated with decreased 

20 INTRA expression or activity, it is desirable to increase the expression or activity of INTRA. 

Therefore, in one embodiment, INTRA or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of INTRA. Examples of such disorders include, but are not limited to, a cell proliferative 
disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed 

25 connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, 

polycythemia vera, psoriasis, primary thrombocythemia, hematopoietic cancer including lymphoma, 
leukemia, and myeloma; and other cancers including adenocarcinoma, leukemia, lymphoma, 
melanoma, myeloma, sarcoma, teratocarcinoma, adenoma, carcinoma and, in particular, cancers of 
the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, 

30 gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, 
salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an autoimmune/inflammatory 
disorder such as acquired immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory 
distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, 
autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune polyendocrinopathy- 

35 candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact dermatitis, Crohn's 
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disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, episodic lymphopenia 
with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 
glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 
5 pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's 
syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic 
lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, 
Werner syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, 
bacterial, fungal, parasitic, protozoal, and helminthic infections, and trauma ; and a gastrointestinal 

10 disorder such as dysphagia, peptic esophagitis, esophageal spasm, esophageal stricture, esophageal 
carcinoma, dyspepsia, indigestion, gastritis, gastric carcinoma, anorexia, nausea, emesis, 
gastroparesis, antral or pyloric edema, abdominal angina, pyrosis, gastroenteritis, intestinal 
obstruction, infections of the intestinal tract, peptic ulcer, cholelithiasis, cholecystitis, cholestasis, 
pancreatitis, pancreatic carcinoma, biliary tract disease, hepatitis, hyperbilirubinemia, cirrhosis, 

15 passive congestion of the liver, hepatoma, infectious colitis, ulcerative colitis, ulcerative proctitis, 
Crohn's disease, Whipple's disease, Mallory-Weiss syndrome, colonic carcinoma, colonic 
obstruction, irritable bowel syndrome, short bowel syndrome, diarrhea, constipation, gastrointestinal 
hemorrhage, acquired immunodeficiency syndrome (AIDS) enteropathy, jaundice, hepatic 
encephalopathy, hepatorenal syndrome, hepatic steatosis, hemochromatosis, Wilson's disease, alpha,- 

20 antitrypsin deficiency, Reye's syndrome, primary sclerosing cholangitis, liver infarction, portal vein 
obstruction and thrombosis, centrilobular necrosis, peliosis hepatis, hepatic vein thrombosis, veno- 
occlusive disease, preeclampsia, eclampsia, acute fatty liver of pregnancy, intrahepatic cholestasis of 
pregnancy, and a hepatic tumor including a nodular hyperplasia,. a neurological disorder such as 
epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's 

25 disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 

amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, 
retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial 
and viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 
thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion diseases 

30 including kuru, Creutzfeldt- Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, fatal 
familial insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, 
tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental 
retardation and other developmental disorders of the central nervous system, cerebral palsy, 
neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord 

35 diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system 
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disorders, dermatomyositis and polymyositis, inherited, metabolic, endocrine, and toxic myopathies, 
myasthenia gravis, periodic paralysis, mental disorders including mood, anxiety, and schizophrenic 
disorders, akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid 
psychoses, postherpetic neuralgia, and Tourette's disorder; and a gastrointestinal disorder such as 
5 esophagitis, esophageal carcinoma, gastritis, gastric carcinoma, inflammatory bowel disease, 

cholecystitis, infections of the intestinal tract, pancreatitis, pancreatic carcinoma, cirrhosis, hepatitis, 
hepatoma, colitis, colonic carcinoma, and Crohn's disease. 

In another embodiment, a vector capable of expressing INTRA or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 

10 expression or activity of INTRA including, but not limited to, those described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially purified 
INTRA in conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat 
or prevent a disorder associated with decreased expression or activity of INTRA including, but not 
limited to, those provided above. 

15 In still another embodiment, an agonist which modulates the activity of INTRA may be 

administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of INTRA including, but not limited to, those listed above. 

In a further embodiment, an antagonist of INTRA may be administered to a subject to treat or 
prevent a disorder associated with increased expression or activity of INTRA. Examples of such 

20 disorders include, but are not limited to, those cell proliferative, autoimmune/inflammatory, 

neurological, gastrointestinal, reproductive, and developmental disorders described above. In one 
aspect, an antibody which specifically binds INTRA may be used directly as an antagonist or 
indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to cells or tissues 
which express INTRA. 

25 In an additional embodiment, a vector expressing the complement of the polynucleotide 

encoding INTRA may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of INTRA including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
sequences, or vectors of the invention may be administered in combination with other appropriate 

30 therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made 
by one of ordinary skill in the art, according to conventional pharmaceutical principles. The 
combination of therapeutic agents may act synergistically to effect the treatment or prevention of the 
various disorders described above. Using this approach, one may be able to achieve therapeutic 
efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 
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An antagonist of INTRA may be produced using methods which are generally known in the 
art. In particular, purified INTRA may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind INTRA. Antibodies to INTRA may 
also be generated using methods that are well known in the art. Such antibodies may include, but are 
5 not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and 
fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit 
dimer formation) are generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and others may be immunized by injection with INTRA or with any fragment or oligopeptide thereof 

10 which has immunogenic properties. Depending on the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral 
gels such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic 
polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in 
humans, BCG (bacilli Calmette-Guerin) and Corvnebacterium parvum are especially preferable. 

15 It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 

INTRA have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
fragments are identical to a portion of the amino acid sequence of the natural protein. Short stretches 
of INTRA amino acids may be fused with those of another protein, such as KLH, and antibodies to 

20 the chimeric molecule may be produced. 

Monoclonal antibodies to INTRA may be prepared using any technique which provides for 
the production of antibody molecules by continuous cell lines in culture. These include, but are not 
limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 

25 Immunol. Methods 81:31-42; Cote, RJ. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 
Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 

30 Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce INTRA-specific single 
chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., 

35 Burton, D.R. (1991) Proc. Natl. Acad. Sci. USA 88:10134-10137.) 
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Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening immunoglobulin libraries or panels of highly specific binding reagents as 
disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 
5 Antibody fragments which contain specific binding sites for INTRA may also be generated. 

For example, such fragments include, but are not limited to, F(ab') 2 fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab^)2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
easy identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. 

10 et al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation between INTRA and its 

15 specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering INTRA epitopes is generally used, but a competitive binding assay 
may also be employed (Pound, supra ). 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay 
techniques may be used to assess the affinity of antibodies for INTRA. Affinity is expressed as an 

20 association constant, K a , which is defined as the molar concentration of INTRA-antibody complex 
divided by the molar concentrations of free antigen and free antibody under equilibrium conditions. 
The K a determined for a preparation of polyclonal antibodies, which are heterogeneous in their 
affinities for multiple INTRA epitopes, represents the average affinity, or avidity, of the antibodies 
for INTRA. The K a determined for a preparation of monoclonal antibodies, which are monospecific 

25 for a particular INTRA epitope, represents a true measure of affinity. High-affinity antibody 

preparations with K a ranging from about 10 9 to 10 12 L/mole are preferred for use in immunoassays in 
which the INTRA-antibody complex must withstand rigorous manipulations. Low-affinity antibody 
preparations with K a ranging from about 10 6 to 10 7 L/mole are preferred for use in 
immunopurification and similar procedures which ultimately require dissociation of INTRA, 

30 preferably in active form, from the antibody (Catty, D. (1988) Antibodies. Volume I: A Practical 
Approach , JRL Press, Washington DC; Liddell, J.E. and A. Cryer (1991) A Practical Guide to 
Monoclonal Antibodies . John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to 
determine the quality and suitability of such preparations for certain downstream applications. For 

35 example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
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preferably 5-10 mg specific antibody/ml, is generally employed in procedures requiring precipitation 
of INTRA-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, and 
guidelines for antibody quality and usage in various applications, are generally available. (See, e.g., 
Catty, supra , and Coligan et al., supra .) 
5 In another embodiment of the invention, the polynucleotides encoding INTRA, or any 

fragment or complement thereof, may be used for therapeutic purposes. In one aspect, modifications 
of gene expression can be achieved by designing complementary sequences or antisense molecules 
(DNA, RNA, PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene 
encoding INTRA. Such technology is well known in the art, and antisense oligonucleotides or larger 
10 fragments can be designed from various locations along the coding or control regions of sequences 
encoding INTRA. (See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics . Humana Press Inc., 
Totawa NJ.) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 

15 intracellularly in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 
Slater, J.E. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, K.J. et al. (1995) 
9(13): 1288-1296.) Antisense sequences can also be introduced intracellularly through the use of viral 
vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A.D. (1990) Blood 

20 76:271; Ausubel, supra : Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 
systems known in the art. (See, e.g., Rossi, J.J. (1995) Br. Med. Bull. 51(l):217-225; Boado, R.J. et 
al. (1998) J. Pharm. Sci. 87(11): 1308-1315; and Morris, M.C. et al. (1997) Nucleic Acids Res. 
25(14):2730-2736.) 

25 In another embodiment of the invention, polynucleotides encoding INTRA may be used for 

somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SCED)-Xl disease characterized by X- 
linked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 

30 (Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470^75), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:643-666; Crystal, R.G, et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 
hypercholesterolemia, and hemophilia resulting from Factor Vm or Factor IX deficiencies (Crystal, 
R.G. (1995) Science 270:404-410; Verma, I.M. and Somia, N. (1997) Nature 389:239-242)), (ii) 

35 express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
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cell proliferation), or (iii) express a protein which affords protection against intracellular parasites 
(e.g., against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. 
(1988) Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. USA. 93:11395-11399), 
hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 

5 brasiliensis ; and protozoan parasites such as Plasmodium falciparum and Trypanoso ma cruzi). In the 
case where a genetic deficiency in INTRA expression or regulation causes disease, the expression of 
INTRA from an appropriate population of transduced cells may alleviate the clinical manifestations 
caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in 

10 INTRA are treated by constructing mammalian expression vectors encoding INTRA and introducing 
these vectors by mechanical means into INTRA-deficient cells. Mechanical transfer technologies for 
use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) 
ballistic gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene 
transfer, and (v) the use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. 

15 Biochem. 62:191-217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. and H. Recipon (1998) Curr. 
Opin. Biotechnol. 9:445-450). 

Expression vectors that may be effective for the expression of INTRA include, but are not 
limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad CA), 
PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF, 

20 PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). INTRA may be expressed 
using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous sarcoma virus 
(RSV), SV40 virus, thymidine kinase (TK), or P-actin genes), (ii) an inducible promoter (e.g., the 
tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. Acad. Sci. U.S.A. 
89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. and H.M. Blau (1998) 

25 Curr. Opin. Biotechnol. 9:451-456), commercially available in the T-REX plasmid (Invitrogen)); the 
ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the 
FK506/raparnycin inducible promoter; or the RU486/mifepristone inducible promoter (Rossi, F.M.V. 
and H.M. Blau, supra)), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding INTRA from a normal individual. 

30 Commercially available liposome transformation kits (e.g., the PERFECT LIPID 

TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. and A. J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
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(1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of 
these standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to INTRA expression are treated by constructing a retrovirus vector consisting of (i) the 

5 polynucleotide encoding INTRA under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus cis-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. 

10 Natl. Acad. Sci. U.S.A. 92:6733-6737), incorporated by reference herein. The vector is propagated in 
an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
A.D. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. 

15 et al. (1998) J. Virol. 72:9873-9880). U.S. Patent Number 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") 
discloses a method for obtaining retrovirus packaging cell lines and is hereby incorporated by 
reference. Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4 + T- 
cells), and the return of transduced cells to a patient are procedures well known to persons skilled in 

20 the art of gene therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020- 
7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 71:4707-4716; 
Ranga, U. et al. (1998) Proc. Natl. Acad. Sci. U.S.A. 95: 1201-1206; Su, L. (1997) Blood 89:2283- 
2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 
25 polynucleotides encoding INTRA to cells which have one or more genetic abnormalities with respect 
to the expression of INTRA. The construction and packaging of adenovirus-based vectors are well 
known to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to 
be versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
(Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
30 described in U.S. Patent Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), 
hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P. A. et al. (1999) 
Annu. Rev. Nutr. 19:511-544; and Verma, I.M. and N. Somia (1997) Nature 18:389:239-242, both 
incorporated by reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
35 polynucleotides encoding INTRA to target cells which have one or more genetic abnormalities with 
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respect to the expression of INTRA. The use of herpes simplex virus (HSV)-based vectors may be 
especially valuable for introducing INTRA to cells of the central nervous system, for which HSV has 
a tropism. The construction and packaging of herpes-based vectors are well known to those with 
ordinary skill in the art. A replication-competent herpes simplex virus (HSV) type 1 -based vector has 
been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye 
Res. 169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent Number 5,804,413 to DeLuca ("Herpes simplex virus strains for gene transfer*'), which is 
hereby incorporated by reference. U.S. Patent Number 5,804,413 teaches the use of recombinant 
HSV d92 which consists of a genome containing at least one exogenous gene to be transferred to a 
cell under the control of the appropriate promoter for purposes including human gene therapy. Also 
taught by this patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 
and ICP22. For HSV vectors, see also Goins, W.R et al. (1999) J. Virol. 73:519-532 and Xu, H. et al. 
(1994) Dev. Biol. 163:152-161, hereby incorporated by reference. The manipulation of cloned 
herpesvirus sequences, the generation of recombinant virus following the transfection of multiple 
plasmids containing different segments of the large herpesvirus genomes, the growth and propagation 
of herpesvirus, and the infection of cells with herpesvirus are techniques well known to those of 
ordinary skill in the art. 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding INTRA to target cells. The biology of the prototypic alphavirus, 
Semiiki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 
on the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotech. 9:464^69). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full-length genomic RNA, 
resulting in the overproduction of capsid proteins relative to the viral proteins with enzymatic activity 
(e.g., protease and polymerase). Similarly, inserting the coding sequence for INTRA into the 
alphavirus genome in place of the capsid-coding region results in the production of a large number of 
INTRA-coding RNAs and the synthesis of high levels of INTRA in vector transduced cells. While 
alphavirus infection is typically associated with cell lysis within a few days, the ability to establish a 
persistent infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) 
indicates that the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy 
application (Dryga, S.A. et al. (1997) Virology 228:74-83). The wide host range of alphaviruses will 
allow the introduction of INTRA into a variety of cell types. The specific transduction of a subset of 
cells in a population may require the sorting of cells prior to transduction. The methods of 
manipulating infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA 
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transfections, and performing alphavirus infections, are well known to those with ordinary skill in the 
art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions 
-10 and +10 from the start site, may also be employed to inhibit gene expression. Similarly, 

5 inhibition can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful 
because it causes inhibition of the ability of the double helix to open sufficiently for the binding of 
polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances using 
triplex DNA have been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B.E. 
and B.I. Carr, Molecular and Immunologic Approaches . Futura Publishing, Mt. Kisco NY, pp. 163- 

10 177.) A complementary sequence or antisense molecule may also be designed to block translation of 
mRNA by preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 

15 engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding INTRA. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 

20 corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared 

25 by any method known in the art for the synthesis of nucleic acid molecules. These include techniques 
for chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA 
sequences encoding INTRA. Such DNA sequences may be incorporated into a wide variety of 
vectors with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA 

30 constructs that synthesize complementary RNA, constitutively or inducibly, can be introduced into 
cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' 
ends of the molecule, or the use of phosphorothioate or 2'O-methyl rather than phosphodiesterase 

35 linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
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and can be extended in all of these molecules by the inclusion of nontraditional bases such as inosine, 
queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, 
cytidine, guanine, thymine, and uridine which are not as easily recognized by endogenous 
endonucleases. 

5 An additional embodiment of the invention encompasses a method for screening for a 

compound which is effective in altering expression of a polynucleotide encoding INTRA. 
Compounds which may be effective in altering expression of a specific polynucleotide may include, 
but are not limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming 
oligonucleotides, transcription factors and other polypeptide transcriptional regulators, and non- 
10 macromolecular chemical entities which are capable of interacting with specific polynucleotide 

sequences. Effective compounds may alter polynucleotide expression by acting as either inhibitors or 
promoters of polynucleotide expression. Thus, in the treatment of disorders associated with increased 
INTRA expression or activity, a compound which specifically inhibits expression of the 
polynucleotide encoding INTRA may be therapeutically useful, and in the treament of disorders 

15 associated with decreased INTRA expression or activity, a compound which specifically promotes 
expression of the polynucleotide encoding INTRA may be therapeutically useful. 

At least one, and up to a plurality, of test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in 

20 altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatorially or randomly. A sample comprising a 
polynucleotide encoding INTRA is exposed to at least one test compound thus obtained. The sample 

25 may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 
biochemical system. Alterations in the expression of a polynucleotide encoding INTRA are assayed 
by any method commonly known in the art. Typically, the expression of a specific nucleotide is 
detected by hybridization with a probe having a nucleotide sequence complementary to the sequence 
of the polynucleotide encoding INTRA. The amount of hybridization may be quantified, thus 

30 forming the basis for a comparison of the expression of the polynucleotide both with and without 

exposure to one or more test compounds. Detection of a change in the expression of a polynucleotide 
exposed to a test compound indicates that the test compound is effective in altering the expression of 
the polynucleotide. A screen for a compound effective in altering expression of a specific 
polynucleotide can be carried out, for example, using a Schizosaccharomvces pombe gene expression 

35 system (Atkins, D. et al. (1999) U.S. Patent No. 5.932.435; Arndt, G.M. et al. (2000) Nucleic Acids 
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Res. 28:EI5) or a human cell line such as HeLa cell (Clarke. ML. et al. (2000) Biochem. Biophys. 
Res. Commun. 268:8-13). A particular embodiment of the present invention involves screening a 
combinatorial library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide 
nucleic acids, and modified oligonucleotides) for antisense activity against a specific polynucleotide 
5 sequence (Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. 
Patent No. 6,022,691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable 
for use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient. 
10 Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. 
Biotechnoi. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
15 monkeys. 

An additional embodiment of the invention relates to the administration of a pharmaceutical 
composition which generally comprises an active ingredient formulated with a pharmaceutically 
acceptable excipient. Excipients may include, for example, sugars, starches, celluloses, gums, and 
proteins. Various formulations are commonly known and are thoroughly discussed in the latest 

20 edition of Remington's Pharmaceutical Sciences (Maack Publishing, Easton PA). Such 

pharmaceutical compositions may consist of INTRA, antibodies to INTRA, and mimetics, agonists, 
antagonists, or inhibitors of INTRA. 

The pharmaceutical compositions utilized in this invention may be administered by any 
number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 

25 intramedullary, intrathecal, intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, 
intranasal, enteral, topical, sublingual, or rectal means. 

Pharmaceutical compositions for pulmonary administration may be prepared in liquid or dry 
powder form. These compositions are generally aerosolized immediately prior to inhalation by the 
patient. In the case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol 

30 delivery of fast-acting formulations is well-known in the art. In the case of macromolecules (e.g. 
larger peptides and proteins), recent developments in the field of pulmonary delivery via the alveolar 
region of the lung have enabled the practical delivery of drugs such as insulin to blood circulation 
(see, e.g., Patton, J.S. et al., U.S. Patent No. 5,997,848). Pulmonary delivery has the advantage of 
administration without needle injection, and obviates the need for potentially toxic penetration 

35 enhancers. 
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Pharmaceutical compositions suitable for use in the invention include compositions wherein 
the active ingredients are contained in an effective amount to achieve the intended purpose. The 
determination of an effective dose is well within the capability of those skilled in the art. 

Specialized forms of pharmaceutical compositions may be prepared for direct intracellular 
5 delivery of macromolecules comprising INTRA or fragments thereof. For example, liposome 

preparations containing a cell-impermeable macromolecule may promote cell fusion and intracellular 
delivery of the macromolecule. Alternatively, INTRA or a fragment thereof may be joined to a short 
cationic N-terminal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been 
found to transduce into the cells of all tissues, including the brain, in a mouse model system 

10 (Schwarze, S.R. et al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, 
monkeys, or pigs. An animal model may also be used to determine the appropriate concentration 
range and route of administration. Such information can then be used to determine useful doses and 

15 routes for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
INTRA or fragments thereof, antibodies of INTRA, and agonists, antagonists or inhibitors of INTRA, 
which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined 
by standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 

20 calculating the ED^ (the dose therapeutically effective in 50% of the population) or LD^ (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LD 50 fED 50 ratio. Pharmaceutical compositions 
which exhibit large therapeutic indices are preferred. The data obtained from cell culture assays and 
animal studies are used to formulate a range of dosage for human use. The dosage contained in such 

25 compositions is preferably within a range of circulating concentrations that includes the ED^ with 
little or no toxicity. The dosage varies within this range depending upon the dosage form employed, 
the sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the 

30 active moiety or to maintain the desired effect. Factors which may be taken into account include the 
severity of the disease state, the general health of the subject, the age, weight, and gender of the 
subject, time and frequency of administration, drug combination(s), reaction sensitivities, and 
response to therapy. Long-acting pharmaceutical compositions may be administered every 3 to 4 
days, every week, or biweekly depending on the half-life and clearance rate of the particular 

35 formulation. 
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Normal dosage amounts may vary from about 0.1 Mg to 100,000 jug, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
5 inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind INTRA may be used for the 
diagnosis of disorders characterized by expression of INTRA, or in assays to monitor patients being 

10 treated with INTRA or agonists, antagonists, or inhibitors of INTRA. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as described above for therapeutics. 
Diagnostic assays for INTRA include methods which utilize the antibody and a label to detect INTRA 
in human body fluids or in extracts of cells or tissues. The antibodies may be used with or without 
modification, and may be labeled by covalent or non-covalent attachment of a reporter molecule. A 

15 wide variety of reporter molecules, several of which are described above, are known in the art and 
may be used. 

A variety of protocols for measuring INTRA, including ELISAs, RIAs, and FACS, are known 
in the art and provide a basis for diagnosing altered or abnormal levels of INTRA expression. Normal 
or standard values for INTRA expression are established by combining body fluids or cell extracts 

20 taken from normal mammalian subjects, for example, human subjects, with antibody to INTRA under 
conditions suitable for complex formation. The amount of standard complex formation may be 
quantitated by various methods, such as photometric means. Quantities of INTRA expressed in 
subject, control, and disease samples from biopsied tissues are compared with the standard values. 
Deviation between standard and subject values establishes the parameters for diagnosing disease. 

25 In another embodiment of the invention, the polynucleotides encoding INTRA may be used 

for diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect 
and quantify gene expression in biopsied tissues in which expression of INTRA may be correlated 
with disease. The diagnostic assay may be used to determine absence, presence, and excess 

30 expression of INTRA, and to monitor regulation of INTRA levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding INTRA or closely related molecules may be used 
to identify nucleic acid sequences which encode INTRA. The specificity of the probe, whether it is 
made from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a 

35 conserved motif, and the stringency of the hybridization or amplification will determine whether the 
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probe identifies only naturally occurring sequences encoding INTRA, allelic variants, or related 
sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the INTRA encoding sequences. The hybridization probes of the subject 
5 invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO: 53-104 or 
from genomic sequences including promoters, enhancers, and introns of the INTRA gene. 

Means for producing specific hybridization probes for DNAs encoding INTRA include the 
cloning of polynucleotide sequences encoding INTRA or INTRA derivatives into vectors for the 
production of mRNA probes. Such vectors are known in the art, are commercially available, and may 

10 be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 

polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 
Polynucleotide sequences encoding INTRA may be used for the diagnosis of disorders 

15 associated with expression of INTRA. Examples of such disorders include, but are not limited to, a 
cell proliferative disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, 
hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, hematopoietic cancer 
including lymphoma, leukemia, and myeloma; and other cancers including adenocarcinoma, 

20 leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, adenoma, carcinoma and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, 
parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus ; an 
autoimmune/inflammatory disorder such as acquired immunodeficiency syndrome (AIDS), Addison's 

25 disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, 
asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 
episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 

30 gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's 
thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, 
psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic 
anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative 

35 colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and extracorporeal 
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circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and trauma ; and a 
gastrointestinal disorder such as dysphagia, peptic esophagitis, esophageal spasm, esophageal 
stricture, esophageal carcinoma, dyspepsia, indigestion, gastritis, gastric carcinoma, anorexia, nausea, 
emesis, gastroparesis, antral or pyloric edema, abdominal angina, pyrosis, gastroenteritis, intestinal 
5 obstruction, infections of the intestinal tract, peptic ulcer, cholelithiasis, cholecystitis, cholestasis, 
pancreatitis, pancreatic carcinoma, biliary tract disease, hepatitis, hyperbilirubinemia, cirrhosis, 
passive congestion of the liver, hepatoma, infectious colitis, ulcerative colitis, ulcerative proctitis, 
Crohn's disease, Whipple's disease, Mallory-Weiss syndrome, colonic carcinoma, colonic 
obstruction, irritable bowel syndrome, short bowel syndrome, diarrhea, constipation, gastrointestinal 

10 hemorrhage, acquired immunodeficiency syndrome (AIDS) enteropathy, jaundice, hepatic 

encephalopathy, hepatorenal syndrome, hepatic steatosis, hemochromatosis, Wilson's disease, alpha r 
antitrypsin deficiency, Reye's syndrome, primary sclerosing cholangitis, liver infarction, portal vein 
obstruction and thrombosis, centrilobular necrosis, peliosis hepatis, hepatic vein thrombosis, veno- 
occlusive disease, preeclampsia, eclampsia, acute fatty liver of pregnancy, intrahepatic cholestasis of 

15 pregnancy, and a hepatic tumor including a nodular hyperplasia,^ neurological disorder such as 

epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's 
disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 
amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, 
retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial 

20 and viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 
thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion diseases 
including kuru, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, fatal 
familial insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, 
tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental 

25 retardation and other developmental disorders of the central nervous system, cerebral palsy, 

neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord 
diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system 
disorders, dermatomyositis and polymyositis, inherited, metabolic, endocrine, and toxic myopathies, 
myasthenia gravis, periodic paralysis, mental disorders including mood, anxiety, and schizophrenic 

30 disorders, akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid 
psychoses, postherpetic neuralgia, and Tourette's disorder; and a gastrointestinal disorder such as 
esophagitis, esophageal carcinoma, gastritis, gastric carcinoma, inflammatory bowel disease, 
cholecystitis, infections of the intestinal tract, pancreatitis, pancreatic carcinoma, cirrhosis, hepatitis, 
hepatoma, colitis, colonic carcinoma, and Crohn's disease. The polynucleotide sequences encoding 

35 INTRA may be used in Southern or northern analysis, dot blot, or other membrane-based 
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technologies; in PCR technologies; in dipstick, pin, and multiformat ELISA-like assays; and in 
microarrays utilizing fluids or tissues from patients to detect altered INTRA expression. Such 
qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding INTRA may be useful in assays that 
5 detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding INTRA may be labeled by standard methods and added to a fluid or tissue 
sample from a patient under conditions suitable for the formation of hybridization complexes. After a 
suitable incubation period, the sample is washed and the signal is quantified and compared with a 
standard value. If the amount of signal in the patient sample is significantly altered in comparison to 

10 a control sample then the presence of altered levels of nucleotide sequences encoding INTRA in the 
sample indicates the presence of the associated disorder. Such assays may also be used to evaluate 
the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to 
monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of 

15 INTRA, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, encoding INTRA, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values obtained from 
normal subjects with values from an experiment in which a known amount of a substantially purified 

20 polynucleotide is used. Standard values obtained in this manner may be compared with values 

obtained from samples from patients who are symptomatic for a disorder. Deviation from standard 
values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 

25 patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
30 development of the disease, or may provide a means for detecting the disease prior to the appearance 
of actual clinical symptoms. A more definitive diagnosis of this type may allow health professionals 
to employ preventative measures or aggressive treatment earlier thereby preventing the development 
or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding 
35 INTRA may involve the use of PCR. These oligomers may be chemically synthesized, generated 
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enzymaticaliy, or produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide 
encoding INTRA, or a fragment of a polynucleotide complementary to the polynucleotide encoding 
INTRA, and will be employed under optimized conditions for identification of a specific gene or 
condition. Oligomers may also be employed under less stringent conditions for detection or 

5 quantification of closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 
encoding INTRA may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 
substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic 
disease in humans. Methods of SNP detection include, but are not limited to, single-stranded 

10 conformation polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, 

oligonucleotide primers derived from the polynucleotide sequences encoding INTRA are used to 
amplify DNA using the polymerase chain reaction (PCR). The DNA may be derived, for example, 
from diseased or normal tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause 
differences in the secondary and tertiary structures of PCR products in single-stranded form, and 

15 these differences are detectable using gel electrophoresis in non-denaturing gels. In fSCCP, the 

oligonucleotide primers are fluorescently labeled, which allows detection of the amplimers in high- 
throughput equipment such as DNA sequencing machines. Additionally, sequence database analysis 
methods, termed in silico SNP (isSNP), are capable of identifying polymorphisms by comparing the 
sequence of individual overlapping DNA fragments which assemble into a common consensus 

20 sequence. These computer-based methods filter out sequence variations due to laboratory preparation 
of DNA and sequencing errors using statistical models and automated analyses of DNA sequence 
chromatograms. In the alternative, SNPs may be detected and characterized by mass spectrometry 
using, for example, the high throughput MASSARRAY system (Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify the expression of INTRA include radiolabeling 

25 or biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. 
et al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 
interest is presented in various dilutions and a spectrophotometric or colorimetric response gives 

30 rapid quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as elements on a microarray. The microarray 
can be used in transcript imaging techniques which monitor the relative expression levels of large 
numbers of genes simultaneously as described in Seilhamer, J.J. et al., "Comparative Gene Transcript 

35 Analysis," U.S. Patent No. 5,840,484, incorporated herein by reference. The microarray may also be 
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used to identify genetic variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, to 
monitor progression/regression of disease as a function of gene expression, and to develop and 
monitor the activities of therapeutic agents in the treatment of disease. In particular, this information 
5 may be used to develop a pharmacogenomic profile of a patient in order to select the most appropriate 
and effective treatment regimen for that patient. For example, therapeutic agents which are highly 
effective and display the fewest side effects may be selected for a patient based on his/her 
pharmacogenomic profile. 

In another embodiment, antibodies specific for INTRA, or INTRA or fragments thereof may 

10 be used as elements on a microarray. The microarray may be used to monitor or measure protein- 
protein interactions, drug-target interactions, and gene expression profiles, as described above. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 
Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 
USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251116; Shalon, D. et al. 

15 (1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 
2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are 
well known and thoroughly described in DNA Microarrays: A Practical Approach , M. Schena, ed. 
(1999) Oxford University Press, London, hereby expressly incorporated by reference. 

In another embodiment of the invention, nucleic acid sequences encoding INTRA may be 

20 used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. 
Either coding or noncoding sequences may be used, and in some instances, noncoding sequences may 
be preferable over coding sequences. For example, conservation of a coding sequence among 
members of a multi-gene family may potentially cause undesired cross hybridization during 
chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific 

25 region of a chromosome, or to artificial chromosome constructions, e.g., human artificial 

chromosomes (HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes 
(BACs), bacterial PI constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, JJ. 
et al. (1997) Nat. Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. 
(1991) Trends Genet. 7:149-154.) Once mapped, the nucleic acid sequences of the invention may be 

30 used to develop genetic linkage maps, for example, which correlate the inheritance of a disease state 
with the inheritance of a particular chromosome region or restriction fragment length polymorphism 
(RFLP). (See, e.g., Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic 
map data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra , pp. 965-968.) Examples of genetic 

35 map data can be found in various scientific journals or at the Online Mendelian Inheritance in Man 
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(OMIM) World Wide Web site. Correlation between the location of the gene encoding INTRA on a 
physical map and a specific disorder, or a predisposition to a specific disorder, may help define the 
region of DNA associated with that disorder and thus may further positional cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
5 linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, 
may reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely 

10 localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, 
any sequences mapping to that area may represent associated or regulatory genes for further 
investigation. (See, e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of 
the instant invention may also be used to detect differences in the chromosomal location due to 
translocation, inversion, etc., among normal, carrier, or affected individuals. 

15 In another embodiment of the invention, INTRA, its catalytic or immunogenic fragments, or 

oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 
between INTRA and the agent being tested may be measured. 

20 Another technique for drug screening provides for high throughput screening of compounds 

having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 
synthesized on a solid substrate. The test compounds are reacted with INTRA, or fragments thereof, 
and washed. Bound INTRA is then detected by methods well known in the art. Purified INTRA can 

25 also be coated directly onto plates for use in the aforementioned drug screening techniques. 

Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 
solid support. 

In another embodiment, one may use competitive drug screening assays in which neutralizing 
antibodies capable of binding INTRA specifically compete with a test compound for binding INTRA. 
30 In this manner, antibodies can be used to detect the presence of any peptide which shares one or more 
antigenic determinants with INTRA. 

In additional embodiments, the nucleotide sequences which encode INTRA may be used in 
any molecular biology techniques that have yet to be developed, provided the new techniques rely on 
properties of nucleotide sequences that are currently known, including, but not limited to, such 
35 properties as the triplet genetic code and specific base pair interactions. 



57 



WO 00/77040 PCT/USOO/16636 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent. The following preferred specific 
embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
of the disclosure in any way whatsoever. 
5 The disclosures of all patents, applications, and publications mentioned above and below, in 

particular U.S. Ser. No. 60/139,566 (filing date 16 June 1999), U.S. Ser. No. 60/149,640 (filing date 
17 August 1999), and U.S. Ser. No. 60/164,417 (filing date 9 November 1999), are hereby expressly 
incorporated by reference. 



10 EXAMPLES 
I. Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some 
tissues were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized 
and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a 

15 monophasic solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged 
over CsCl cushions or extracted with chloroform. RNA was precipitated from the lysates with either 
isopropanol or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated 

20 using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 

25 libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra , units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 

30 appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300- 
1000 bp) using SEPHACRYL S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 
PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), pcDNA2.1 plasmid 

35 (Invitrogen, Carlsbad CA), or pINCY plasmid (Incyte Genomics, Palo Alto CA). Recombinant 
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plasmids were transformed into competent E. coli cells including XLl-Blue, XLl-BlueMRF, or 
SOLR from Stratagene or DH5a, DH10B, or ElectroMAX DH10B from Life Technologies. 

II, Isolation of cDNA Clones 

Plasmids obtained as described in Example I were recovered from host cells by in vivo 
5 excision using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using 
at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an 
AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, 
QIAWELL 8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E.A.L. PREP 96 
plasmid purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0. 1 
10 ml of distilled water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 
384-well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically 
15 using PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence 
scanner (Labsystems Oy, Helsinki, Finland). 

III. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 
Sequencing reactions were processed using standard methods or high-throughput instrumentation 

20 such as the ABI CATALYST 800 (PE Biosystems) thermal cycler or the PTC-200 thermal cycler (MJ 
Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the MICROLAB 
2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared using reagents 
provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as the ABI 
PRISM BIGDYE Terminator cycle sequencing ready reaction kit (PE Biosystems). Electrophoretic 

25 separation of cDNA sequencing reactions and detection of labeled polynucleotides were carried out 
using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the ABI PRISM 373 
or 377 sequencing system (PE Biosystems) in conjunction with standard ABI protocols and base 
calling software; or other sequence analysis systems known in the art. Reading frames within the 
cDNA sequences were identified using standard methods (reviewed in Ausubel, 1997, supra , unit 

30 7.7). Some of the cDNA sequences were selected for extension using the techniques disclosed in 
Example VI. 

The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed 
using a combination of software programs which utilize algorithms well known to those skilled in the 
art. Table 5 summarizes the tools, programs, and algorithms used and provides applicable 
35 descriptions, references, and threshold parameters. The first column of Table 5 shows the tools, 
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programs, and algorithms used, the second column provides brief descriptions thereof, the third 
column presents appropriate references, all of which are incorporated by reference herein in their 
entirety, and the fourth column presents, where applicable, the scores, probability values, and other 
parameters used to evaluate the strength of a match between two sequences (the higher the score, the 
5 greater the homology between two sequences). Sequences were analyzed using MACDNASIS PRO 
software (Hitachi Software Engineering, South San Francisco CA) and LASERGENE software 
(DNASTAR). Polynucleotide and polypeptide sequence alignments were generated using the default 
parameters specified by the clustal algorithm as incorporated into the MEGALIGN multisequence 
alignment program (DNASTAR), which also calculates the percent identity between aligned 
10 sequences. 

The polynucleotide sequences were validated by removing vector, linker, and polyA 
sequences and by masking ambiguous bases, using algorithms and programs based on BLAST, 
dynamic programing, and dinucleotide nearest neighbor analysis. The sequences were then queried 
against a selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, 

15 and eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire 

annotation using programs based on BLAST, FAST A, and BLIMPS. The sequences were assembled 
into full length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and 
were screened for open reading frames using programs based on GeneMark, BLAST, and FASTA. 
The full length polynucleotide sequences were translated to derive the corresponding full length 

20 amino acid sequences, and these full length sequences were subsequently analyzed by querying 

against databases such as the GenBank databases (described above), SwissProt, BLOCKS, PRINTS, 
DOMO, PRODOM, Prosite, and Hidden Markov Model (HMM)-based protein family databases such 
as PFAM. HMM is a probabilistic approach which analyzes consensus primary structures of gene 
families. (See, e.g., Eddy, S.R. (1996) Curr. Opin. Struct Biol. 6:361-365.) 

25 The programs described above for the assembly and analysis of full length polynucleotide 

and amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO:53-104. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization 
and amplification technologies were described in The Invention section above. 
IV. Analysis of Polynucleotide Expression 

30 Northern analysis is a laboratory technique used to detect the presence of a transcript of a 

gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 
from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel, 
1995, supra , ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 

35 molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
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much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the 
computer search can be modified to determine whether any particular match is categorized as exact or 
similar. The basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 

5 5 x minimum {length(Seq. 1), length(Seq. 2)} 

The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. The product score is a normalized value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 

10 product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is 
calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair 
(HSP), and -4 for every mismatch. Two sequences may share more than one HSP (separated by 
gaps). If there is more than one HSP, then the pair with the highest BLAST score is used to calculate 
the product score. The product score represents a balance between fractional overlap and quality in a 

15 BLAST alignment. For example, a product score of 100 is produced only for 100% identity over the 
entire length of the shorter of the two sequences being compared. A product score of 70 is produced 
either by 100% identity and 70% overlap at one end, or by 88% identity and 100% overlap at the 
other. A product score of 50 is produced either by 100% identity and 50% overlap at one end, or 79% 
identity and 100% overlap. 

20 The results of northern analyses are reported as a percentage distribution of libraries in which 

the transcript encoding INTRA occurred. Analysis involved the categorization of cDNA libraries by 
organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 
reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, 

25 cell proliferation, neurological, and pooled. For each category, the number of libraries expressing the 
sequence of interest was counted and divided by the total number of libraries across all categories. 
Percentage values of tissue-specific and disease- or condition-specific expression are reported in 
Table 3. 

V. Chromosomal Mapping of ABBR Encoding Polynucleotides 

30 The cDNA sequences which were used to assemble SEQ ID NO: 8- 14 were compared with 

sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:8-14 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 5). Radiation hybrid and genetic mapping data available 

35 from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
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Genome Research (WIGR), and Genethon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

The genetic map locations of SEQ ID NO:8-14 [fill in the specific SEQ ID NOs if not all of 
the sequences have been mapped] are described in The Invention as ranges, or intervals, of human 
chromosomes. [Include the following sentence if any of your sequences have more than one map 
location.] More than one map location is reported for SEQ ID NO:8-14 [fill in specific SEQ ID 
NO:s], indicating that previously mapped sequences having similarity, but not complete identity, to 
SEQ ID NO:8-14 [fill in specific SEQ ID NO:s] were assembled into their respective clusters. The 
map position of an interval, in centiMorgans, is measured relative to the terminus of the 
chromosome's p-arm. (The centiMorgan (cM) is a unit of measurement based on recombination 
frequencies between chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase 
(Mb) of DNA in humans, although this can vary widely due to hot and cold spots of recombination.) 
The cM distances are based on genetic markers mapped by Genethon which provide boundaries for 
radiation hybrid markers whose sequences were included in each of the clusters. Human genome 
maps and other resources available to the public, such as the NCBI "GeneMap'99" World Wide Web 
site (http://www.ncbi.nlm.nih.gov/genemap/ ). can be employed to determine if previously identified 
disease genes map within or in proximity to the intervals indicated above. 
VI. Extension of INTRA Encoding Polynucleotides 

The full length nucleic acid sequences of SEQ ID NO:53-104 were produced by extension of 
an appropriate fragment of the full length molecule using oligonucleotide primers designed from this 
fragment. One primer was synthesized to initiate 5 'extension of the known fragment, and the other 
primer, to initiate 3 'extension of the known fragment. The initial primers were designed using 
OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 
nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target 
sequence at temperatures of about 68 °C to about 72 °C. Any stretch of nucleotides which would 
result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art. PCR 
was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2 *, (NH 4 ) 2 S0 4 , 
and (3-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer 
pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C 1 min; Step 4: 68°C, 
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2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1: 94 °C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68°C 5 min; Step 7: storage at 4°C. 
5 The concentration of DNA in each well was determined by dispensing 100 fil PICOGREEN 

quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 /xl of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
10 concentration of DNA. A 5 y\ to 10 (A aliquot of the reaction mixture was analyzed by 

electrophoresis on a 1 % agarose mini-gel to determine which reactions were successful in extending 
the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 

15 sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones 
were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 
Pharmacia Biotech), treated with Pfii DNA polymerase (Stratagene) to fill-in restriction site 

20 overhangs, and transfected into competent E. coli cells. Transformed cells were selected on 

antibiotic-containing media, and individual colonies were picked and cultured overnight at 37°C in 
384-well plates in LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 
(Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 

25 parameters: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; 
Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C DNA was 
quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA 
recoveries were reamplified using the same conditions as described above. Samples were diluted 
with 20% dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer sequencing 

30 primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM 
BIGDYE Terminator cycle sequencing ready reaction kit (PE Biosystems). 

In like manner, the polynucleotide sequences of SEQ ID NO:53-104 are used to obtain 5' 
regulatory sequences using the procedure above, along with oligonucleotides designed for such 
extension, and an appropriate genomic library. 

35 VII. Labeling and Use of Individual Hybridization Probes 
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Hybridization probes derived from SEQ ID NO.53-104 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
5 software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 piCi of 
[y- 32 P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 
SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 
An aliquot containing 10 7 counts per minute of the labeled probe is used in a typical membrane-based 

10 hybridization analysis of human genomic DNA digested with one of the following endonucleases: 
Ase I, Bgl H, Eco RI, Pst I, Xba I, or Pvu H (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature 

15 under conditions of up to, for example, 0. 1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

VIII. Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 

20 photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweiler, supra) , 
mechanical microspotting technologies, and derivatives thereof. The substrate in each of the 
aforementioned technologies should be uniform and solid with a non-porous surface (Schena (1999), 
supra ). Suggested substrates include silicon, silica, glass slides, glass chips, and silicon wafers. 
Alternatively, a procedure analogous to a dot or slot blot may also be used to arrange and link 

25 elements to the surface of a substrate using thermal, UV, chemical, or mechanical bonding 

procedures. A typical array may be produced using available methods and machines well known to 
those of ordinary skill in the art and may contain any appropriate number of elements. (See, e.g., 
Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645; 
Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 16:27-31.) 

30 Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof 

may comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can 
be selected using software well known in the art such as LASERGENE software (DNASTAR). The 
array elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 

35 After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
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fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element 
on the microarray may be assessed. In one embodiment, microarray preparation and usage is 
5 described in detail below. 

Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
poly(A)" RNA is purified using the oligo-(dT) cellulose method. Each poly(A) + RNA sample is 
reverse transcribed using MMLV reverse-transcriptase, 0.05 pg//xl oligo-(dT) primer (21mer), IX 
10 first strand buffer, 0.03 units//*! RNase inhibitor, 500 /xM dATP, 500 jjlM dGTP, 500 fiM dTTP, 40 
fiM dCTP, 40 fiM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly(A) + RNA with 
GEMBRIGHT kits (Incyte). Specific control poly(A) + RNAs are synthesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37 °C for 2 hr, each reaction sample (one 
15 with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 

incubated for 20 minutes at 85 °C to the stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
20 then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and 
resuspended in 14 fil 5X SSC/0.2% SDS. 
Microarray Preparation 

Sequences of the present invention are used to generate array elements. Each array element 
is amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification 
25 uses primers complementary to the vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 
/xg. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia 
Biotech). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
30 slides (Corning) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, 
and coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 
110°Coven. 
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Array elements are applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522, incorporated herein by reference. 1 /il of the array element DNA, at an average 
concentration of 100 ng//xl, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 
5 Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 

Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60 °C followed by washes in 
0.2% SDS and distilled water as before. 
10 Hybridization 

Hybridization reactions contain 9 fi\ of sample mixture consisting of 0.2 /zg each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65 °C for 5 minutes and is aliquoted onto the microarray surface and covered 
with an 1.8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just 

15 slightly larger than a microscope slide. The chamber is kept at 100% humidity internally by the 
addition of 140 fi\ of 5X SSC in a corner of the chamber. The chamber containing the arrays is 
incubated for about 6.5 hours at 60 °C. The arrays are washed for 10 min at 45 °C in a first wash 
buffer (IX SSC, 0.1% SDS), three times for 10 minutes each at 45 °C in a second wash buffer (0.1X 
SSC), and dried. 

20 Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 

25 containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 

30 Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 
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The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that 
location to be correlated with a weight ratio of hybridizing species of 1: 100,000. When two samples 

5 from different sources (e.g., representing test and control cells), each labeled with a different 
fluorophore, are hybridized to a single array for the purpose of identifying genes that are 
differentially expressed, the calibration is done by labeling samples of the calibrating cDNA with the 
two fluorophores and adding identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 

10 (A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 

15 emission spectra) between the fluorophores using each fluorophore' s emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each 
spot is centered in each element of the grid. The fluorescence signal within each element is then 
integrated to obtain a numerical value corresponding to the average intensity of the signal. The 
software used for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 

20 IX. Complementary Polynucleotides 

Sequences complementary to the INTRA-encoding sequences, or any parts thereof, are used 
to detect, decrease, or inhibit expression of naturally occurring INTRA. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 
procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are 

25 designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of INTRA. To 
inhibit transcription, a complementary oligonucleotide is designed from the most unique 5 'sequence 
and used to prevent promoter binding to the coding sequence. To inhibit translation, a 
complementary oligonucleotide is designed to prevent ribosomal binding to the INTRA-encoding 
transcript. 

30 X- Expression of INTRA 

Expression and purification of INTRA is achieved using bacterial or virus-based expression 
systems. For expression of INTRA in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid 

35 promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
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element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express INTRA upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of INTRA in eukaryotic cells is achieved by infecting 
insect or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis virus 
5 (AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding INTRA by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodoptera frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
10 Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.K. 
et al. (1994) Proc. Natl Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 
7:1937-1945.) 

In most expression systems, INTRA is synthesized as a fusion protein with, e.g., glutathione 
S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 

15 affinity-based purification of recombinant fusion protein from crude cell ly sates. GST, a 26- 
kilodalton enzyme from Schistosoma iaponicum . enables the purification of fusion proteins on 
immobilized glutathione under conditions that maintain protein activity and antigenicity (Amersham 
Pharmacia Biotech). Following purification, the GST moiety can be proteolytically cleaved from 
INTRA at specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity 

20 purification using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman 
Kodak). 6-His, a stretch of six consecutive histidine residues, enables purification on metal-chelate 
resins (QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, 
supra , ch. 10 and 16). Purified INTRA obtained by these methods can be used directly in the assays 
shown in Examples XI, XII, and XV. 

25 XI. Demonstration of INTRA Activity 

INTRA activity is associated with its ability to form protein-protein complexes and is 
measured by its ability to regulate growth characteristics of NIH3T3 mouse fibroblast cells. A cDNA 
encoding INTRA is subcloned into an appropriate eukaryotic expression vector. This vector is 
transfected into NIH3T3 cells using methods known in the art. Transfected cells are compared with 

30 non-transfected cells for the following quantifiable properties: growth in culture to high density, 
reduced attachment of cells to the substrate, altered cell morphology, and ability to induce tumors 
when injected into immunodeficient mice. The activity of INTRA is proportional to the extent of 
increased growth or frequency of altered cell morphology in NIH3T3 cells transfected with INTRA. 
Alternatively, INTRA activity is measured by binding of INTRA to radiolabeled formin 

35 polypeptides containing the proline-rich region that specifically binds to SH3 containing proteins 
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(Chan, D.C. et al. (1996) EMBO J. 15: 1045-54). Samples of INTRA are run on SDS-PAGE gels, 
and transferred onto nitrocellulose by electroblotting. The blots are blocked for 1 hr at room 
temperature in TBST (137 mM NaCl, 2.7 mM Kcl, 25 mM Tris (pH 8.0) and 0.1% Tween-20) 
containing non-fat dry milk. Blots are then incubated with TBST containing the radioactive formin 
5 polypeptide for 4 hrs to overnight. After washing the blots four times with TBST, the blots are 
exposed to autoradiographic film. Radioactivity is quantitated by cutting out the radioactive spots 
and counting them in a radioisotope counter. The amount of radioactivity recovered is proportional to 
the activity of INTRA in the assay. 

Alternatively, INTRA activity is demonstrated by measuring the binding of INTRA to Ca 2+ 

10 using a Ca 2+ overlay system (Weis, K. et al. (1994) J. Biol. Chem. 269:19142-19150). Purified 
INTRA is transferred and immobilized onto a nitrocellulose membrane. The membrane is washed 
three times with buffer (60 mM KCK 5 mM MgCl 2 , 10 mM imidazole-HCl, pH 6.8) and incubated in 
this buffer for 10 minutes with 1 /xCi [ 45 Ca 2+ ] (NEN-DuPont, Boston, MA). Unbound [ 45 Ca 2+ ] is 
removed from the membrane by washing with water, and the membrane is dried. Membrane-bound 

15 [ 45 Ca 2+ ] is detected by autoradiography and quantified using image analysis systems and software. 
INTRA activity is proportional to the amount of [ 45 Ca 2+ ] detected on the membrane. 

Alternatively, INTRA activity is assayed by measuring the conversion of 3 H-cAMP to 3 H- 
adenosine in the presence of INTRA and 5* nucleotidase. INTRA is added to a solution containing 50 
mM Tris-HCl pH 7.5, 10 mM MgCl 2 , 0.1 unit 5* nucleotidase (from Crotalus atrox venom), and 

20 0.0064-2.0 uM 3 H- cAMP and the reaction is incubated at 37°C for a time period that would yield less 
than 15% cAMP hydrolysis in order to avoid non-linearity associated with product inhibition. 
Soluble radioactivity associated with 3 H-adenosine is quantitated using a Beta scintillation counter. 
The amount of radioactivity recovered is proportional to the activity of INTRA in the reaction. 

25 XII. Functional Assays 

INTRA function is assessed by expressing the sequences encoding INTRA at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
include pCMV SPORT plasmid (Life Technologies) and pCR3.1 plasmid (Invitrogen), both of which 

30 contain the cytomegalovirus promoter. 5-10 fxg of recombinant vector are transiently transfected into 
a human cell line, for example, an endothelial or hematopoietic cell line, using either liposome 
formulations or electroporation. 1-2 //g of an additional plasmid containing sequences encoding a 
marker protein are co-transfected. Expression of a marker protein provides a means to distinguish 
transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the 

35 recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP; 



69 



WO 00/77040 PCT/US00/16636 

Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an automated, laser optics- 
based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate 
the apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of 
fluorescent molecules that diagnose events preceding or coincident with cell death. These events 

5 include changes in nuclear DN A content as measured by staining of DNA with propidium iodide; 
changes in cell size and granularity as measured by forward light scatter and 90 degree side light 
scatter; down-regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; 
alterations in expression of cell surface and intracellular proteins as measured by reactivity with 
specific antibodies; and alterations in plasma membrane composition as measured by the binding of 

10 fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are 
discussed in Ormerod, M.G. (1994) Flow Cytometry , Oxford, New York NY. 

The influence of INTRA on gene expression can be assessed using highly purified 
populations of cells transfected with sequences encoding INTRA and either CD64 or CD64-GFP. 
CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions 

15 of human immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected 
cells using magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake 
Success NY). mRNA can be purified from the cells using methods well known by those of skill in 
the art. Expression of mRNA encoding INTRA and other genes of interest can be analyzed by 
northern analysis or microarray techniques. 

20 XIII. Production of INTRA Specific Antibodies 

INTRA substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488^95), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the INTRA amino acid sequence is analyzed using LASERGENE software 

25 (DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art. (See, e.g., Ausubel, 1995, supra , ch. 11.) 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 

30 peptide synthesizer (PE Biosystems) using FMOC chemistry and coupled to KLH (Sigma-Aldrich, St. 
Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 
immunogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are immunized with the oligopeptide- 
KLH complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti- 
INTRA activity by, for example, binding the peptide or INTRA to a substrate, blocking with 1% 

35 BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 
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XIV. Purification of Naturally Occurring INTRA Using Specific Antibodies 

Naturally occurring or recombinant INTRA is substantially purified by immunoaffmity 
chromatography using antibodies specific for INTRA. An immunoaffmity column is constructed by 
covalently coupling anti-INTRA antibody to an activated chromatographic resin, such as 

5 CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

Media containing INTRA are passed over the immunoaffmity column, and the column is 
washed under conditions that allow the preferential absorbance of INTRA (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 

10 antibody/INTRA binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and INTRA is collected. 

XV. Identification of Molecules Which Interact with INTRA 

INTRA, or biologically active fragments thereof, are labeled with 125 I Bolton-Hunter reagent. 
(See? e< g M Bolton A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 

15 previously arrayed in the wells of a multi-well plate are incubated with the labeled INTRA, washed, 
and any wells with labeled INTRA complex are assayed. Data obtained using different 
concentrations of INTRA are used to calculate values for the number, affinity, and association of 
INTRA with the candidate molecules. 

Alternatively, molecules interacting with INTRA are analyzed using the yeast two-hybrid 

20 system as described in Fields, S. and O. Song (1989, Nature 340:245-246), or using commercially 
available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

INTRA may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. 

25 Patent No. 6,057,101). 

Various modifications and variations of the described methods and systems of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the 
invention. Although the invention has been described in connection with certain embodiments, it 
should be understood that the invention as claimed should not be unduly limited to such specific 

30 embodiments. Indeed, various modifications of the described modes for carrying out the invention 
which are obvious to those skilled in molecular biology or related fields are intended to be within the 
scope of the following claims. 
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1. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

5 a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID 

NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, 
SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 1 1, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, 
SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, 
SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, 
10 SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, 
SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, 
SEQ ID NO:47, SEQ ID NO:48, SEQ ED NO:49, SEQ ID NO:50, SEQ ED NO:51, and SEQ ID 
NO:52, 

15 b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 

amino acid sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO:2, SEQ ID 
NO:3, SEQ ED NO:4, SEQ ED NO:5, SEQ ED NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ED NO:9, 
SEQ ED NO: 10, SEQ ED NO: 1 1, SEQ ED NO: 12, SEQ ED NO: 13, SEQ ED NO: 14, SEQ ID NO: 15, 
SEQ ED NO:16, SEQ ED NO:17, SEQ ED NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ED NO:21, 

20 SEQ ED NO:22, SEQ ED NO:23, SEQ ED NO:24, SEQ ED NO:25, SEQ ED NO:26, SEQ ED NO:27, 
SEQ ED NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ED NO:31, SEQ ID NO:34, SEQ ED NO:35, 
SEQ ED NO-.36, SEQ ID NO:37, SEQ ED NO:38, SEQ ED NO:39, SEQ ED NO:40, SEQ ID NO:41, 
SEQ ED NO:42, SEQ ID NO:43, SEQ ED NO:44, SEQ ID NO:45, SEQ ED NO:46, SEQ ID NO:47, 
SEQ ED NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ED NO:51, and SEQ ED NO:52, 

25 c) a biologically active fragment of an amino acid sequence selected from the group 

consisting of SEQ ID NO: 1, SEQ ED NO:2, SEQ ED NO:3, SEQ ID NO:4, SEQ ED NO:5, SEQ ED 
NO:6, SEQ ED NO:7, SEQ ID NO:8, SEQ ED NO:9, SEQ ID NO: 10, SEQ ID NO: 1 1, SEQ ID NO: 12, 
SEQ ED NO:13, SEQ ED NO:14, SEQ ED NO:15, SEQ ID NO:16, SEQ ED NO:17, SEQ ED NO:18, 
SEQ ED NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ED NO:22, SEQ ED NO:23, SEQ ID NO:24, 

30 SEQ ED NO:25, SEQ ID NO:26, SEQ ED NO:27, SEQ ED NO:28, SEQ ED NO:29, SEQ ID NO:30, 
SEQ ED NO:31, SEQ ID NO:34, SEQ ID NO:35, SEQ ED NO:36, SEQ ED NO:37, SEQ ED NO:38, 
SEQ ED NO:39, SEQ ID NO:40, SEQ ID NO.41, SEQ ID NO:42, SEQ ED NO:43, SEQ ID NO:44, 
SEQ ED NO:45, SEQ ID NO:46, SEQ ED NO:47, SEQ ED NO:48, SEQ ED NO:49, SEQ ID NO:50, 
SEQ ED NO:51, and SEQ ID NO:52, and 
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d) an immunogenic fragment of an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ 
ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 1 1, SEQ ID NO: 12, SEQ ID 
NO: 13. SEQ ID NO:14, SEQ ID NO: 15, SEQ ED NO: 16, SEQ ID NO:17, SEQ ID NO: 18, SEQ ID 
5 NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID 
NO:25, SEQ ID NO:26, SEQ ED NO:27, SEQ ED NO:28, SEQ ED NO:29, SEQ ID NO:30, SEQ ID 
NO:31, SEQ ID NO:34, SEQ ID NO:35, SEQ ED NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ED 
NO:39, SEQ ED NO:40, SEQ ID NO:41, SEQ ED NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ED 
NO:45, SEQ ED NO:46, SEQ ID NO:47, SEQ ED NO:48, SEQ ID NO:49, SEQ ED NO:50, SEQ ED 
10 NO:5 1, and SEQ ED NO:52. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:l, 
SEQ ED NO:2, SEQ ED NO:3, SEQ ID NO:4, SEQ ED NO:5, SEQ ED NO:6, SEQ ID NO:7, SEQ ID 
NO:8, SEQ ID NO:9, SEQ ED NO: 10, SEQ ED NO: 1 1, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ED 

15 NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ED NO: 17, SEQ ED NO: 18, SEQ ED NO: 19, SEQ ED 
NO:20, SEQ ED NO:21, SEQ ID NO:22, SEQ ED NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ED 
NO:26, SEQ ED NO:27, SEQ ID NO:28, SEQ ED NO:29, SEQ ED NO:30, SEQ ED NO:31, SEQ ID 
NO:34, SEQ ID NO:35, SEQ ED NO:36, SEQ ED NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ED 
NO:40, SEQ ID NO:41, SEQ ED NO:42, SEQ ED NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID 

20 NO:46, SEQ ID NO:47, SEQ ED NO:48, SEQ ED NO:49, SEQ ID NO:50, SEQ ED NO:5 1 , and SEQ 
ED NO:52. 

3. An isolated polynucleotide encoding a polypeptide of claim 1 . 

25 4. An isolated polynucleotide encoding a polypeptide of claim 2. . 

5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ED 
NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ED NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ED 
NO:59, SEQ ID NO:60, SEQ ED NO:61, SEQ ED NO:62, SEQ ID NO:63, SEQ ED NO:64, SEQ ED 
30 NO:65. SEQ ID NO:66, SEQ ED NO:67, SEQ ED NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID 
NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ED NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ED 
NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ED NO:80, SEQ ID NO:81, SEQ ID NO:82, SEQ ID 
NO:83, SEQ ID NO:86, SEQ ID NO:87, SEQ ED NO:88, SEQ ID NO:89, SEQ ED NO:90, SEQ ED 
NO:91, SEQ ID NO:92, SEQ ED NO:93, SEQ ED NO:94, SEQ ID NO:95, SEQ ED NO:96, SEQ ED 
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NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO: 100, SEQ ID NO:101, SEQ ID NO:102, SEQ 
ID NO: 103, and SEQ ID NO: 104. 



6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
5 polynucleotide of claim 3. 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

10 

9. A method for producing a polypeptide of claim 1, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of 

15 claim 1, and 

b) recovering the polypeptide so expressed. 



10. An isolated antibody which specifically binds to a polypeptide of claim 1. 



20 11. An isolated polynucleotide comprising a polynucleotide sequence selected from the 

group consisting of: 

a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:53-104, 

b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:53-104, 

25 c) a polynucleotide sequence complementary to a), 

d) a polynucleotide sequence complementary to b), and 

e) an RNA equivalent of a)-d). 



12. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
30 polynucleotide of claim 1 1 . 

13. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 11, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
35 comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
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specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide or fragments thereof, and 
b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof. 

5 

14. A method of claim 13, wherein the probe comprises at least 60 contiguous nucleotides. 

15. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1, the method comprising: 

10 a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 

amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 

15 16. A pharmaceutical composition comprising an effective amount of a polypeptide of claim 

1 and a pharmaceutically acceptable excipient. 

17. A pharmaceutical composition of claim 16, wherein the polypeptide comprises an amino 
acid sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, 

20 SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID 
NO:10, SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID 
NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID 
NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID 
NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:34, SEQ ID NO:35, SEQ ID 

25 NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID 
NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID 
NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, and SEQ ID NO:52. 

18. A method for treating a disease or condition associated with decreased expression of 
30 functional INTRA, comprising administering to a patient in need of such treatment the 

pharmaceutical composition of claim 16. 

19. A method for screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

35 a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 
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b) detecting agonist activity in the sample. 
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20. A pharmaceutical composition comprising an agonist compound identified by a method 
of claim 19 and a pharmaceutical^ acceptable excipient. 

5 

21. A method for treating a disease or condition associated with decreased expression of 
functional INTRA, comprising administering to a patient in need of such treatment a pharmaceutical 
composition of claim 20. 

10 22. A method for screening a compound for effectiveness as an antagonist of a polypeptide 

of claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound* and 

b) detecting antagonist activity in the sample. 



15 23. A pharmaceutical composition comprising an antagonist compound identified by a 

method of claim 22 and a pharmaceutically acceptable excipient. 

24. A method for treating a disease or condition associated with overexpression of functional 
INTRA, comprising administering to a patient in need of such treatment a pharmaceutical 
20 composition of claim 23. 



25. A method of screening for a compound that specifically binds to the polypeptide of claim 
1, said method comprising the steps of: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
25 conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying 
a compound that specifically binds to the polypeptide of claim 1. 

26. A method of screening for a compound that modulates the activity of the polypeptide of 
30 claim 1, said method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under conditions 
permissive for the activity of the polypeptide of claim 1, 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test compound, 

and 
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c) comparing the activity of the polypeptide of claim 1 in the presence of the test compound 
with the activity of the polypeptide of claim 1 in the absence of the test compound, wherein a change 
in the activity of the polypeptide of claim 1 in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide of claim 1. 

27. A method for screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 
comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, and 

b) detecting altered expression of the target polynucleotide. 
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Glu 


Ala 


Ala 


Arg Arg 


Leu 


He 


Glu 






295 










300 



21/82 



WO 00/77040 



PCT/USOO/16636 



350 355 360 



Phe 


Gin 


Glu 


Val 


GlU 
365 


Asn 


Phe 


Phe 


Thr 


Phe 
370 


Leu 


Lys 


Asn 


He 


Asn 
375 


Asp 


Val 


Asp 


Thr 


Ala 


Leu 


Ser 


Phe 


Tyr 


His 


Met 


Ala 


Gly Ala 


Ser 










380 










385 










390 


Leu 


Asp 


Lys 


Val 


Thr 
395 


Met 


Gin 


Gin 


Val 


Ala 
400 


Arg 


Thr 


Val 


Ala 


Lys 
405 


Val 


Glu 


Leu 


Ser 


Asp 
410 


His 


Val 


Cys 


Asp 


Val 
415 


Val 


Phe 


Ala 


Leu 


Phe 
420 


Asp 


Cys 


Asp 


Gly 


Asn 


Gly Glu 


Leu 


Ser 


Asn 


Lys 


Glu 


Phe 


Val 


Ser 










425 










430 










435 


He 


Met 


Lys 


Gin 


Arg 


Leu 


Met 


Arg Gly 


Leu 


Glu 


Lys 


Pro 


Lys 


Asp 










440 










445 










450 


Met 


Gly 


Phe 


Thr 


Arg 
455 


Leu 


Met 


Gin 


Ala 


Met 
460 


Trp 


Lys 


Cys 


Ala 


Gin 
465 


Glu 


Thr 


Ala 


Trp 


Asp 


Phe 


Ala 


Leu 


Pro 


Lys 


Gin 











470 475 
<210> 22 
<211> 171 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No; 16823 2 0CD1 



<400> 22 



Met 


Glu 


Lys 


Arg 


Leu 


Gin 


Glu 


Ala 


Gin 


Leu 


Tyr 


Lys 


Glu Glu Gly 


1 








5 










10 










15 


Asn 


Gin 


Arg 


Tyr 


Arg 


Glu 


Gly 


Lys 


Tyr 


Arg 


Asp 


Ala 


Val 


Ser 


Arg 










20 










25 










30 


Tyr 


His 


Arg 


Ala 


Leu 


Leu 


Gin 


Leu 


Arg 


Gly 


Leu 


Asp 


Pro 


Ser 


Leu 










35 










40 










45 


Pro 


Ser 


Pro 


Leu 


Pro 


Asn 


Leu 


Gly 


Pro 


Gin 


Gly 


Pro 


Ala 


Leu 


Thr 










50 










55 










60 


Pro 


Glu 


Gin 


Glu 


Asn 


He 


Leu 


His 


Thr 


Thr 


Gin 


Thr 


Asp 


Cys 


Tyr 










65 










70 










75 


Asn 


Asn 


Leu 


Ala 


Ala 


Cys 


Leu 


Leu 


Gin 


Met 


Glu 


Pro 


Val 


Asn 


Tyr 










80 










85 










90 


Glu 


Arg 


Val 


Arg 


Glu 


Tyr 


Ser 


Gin 


Lys 


Val 


Leu 


Glu 


Arg 


Gin 


Pro 










95 










100 










105 


Asp 


Asn 


Ala 


Lys 


Ala 


Leu 


Tyr 


Arg 


Ala 


Gly Val 


Ala 


Phe 


Phe 


His 










110 










115 










120 


Leu 


Gin 


Asp 


Tyr 


Asp 


Gin 


Ala 


Arg 


His 


Tyr 


Leu 


Leu 


Ala 


Ala 


Val 










125 










130 










135 


Asn 


Arg 


Gin 


Pro 


Lys 


Asp 


Ala 


Asn 


Val 


Arg 


Arg 


Tyr 


Leu 


Gin 


Leu 










140 










145 










150 


Thr 


Gin 


Ser 


Glu 


Leu 


Ser 


Ser 


Tyr 


His 


Arg 


Lys 


Glu 


Lys 


Gin 


Leu 










155 










160 










165 


Tyr 


Leu 


Gly Met 


Phe 


Gly 




























170 























<210> 23 
<211> 163 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1728263CD1 

<400> 23 

Met Phe Phe Ser Glu Ala Arg Ala 

1 5 
Ser Pro Ser Glu His Arg Lys Trp 
20 

Asp Glu Asp His Lys Gly Tyr Leu 
35 



Arg Ser Arg Thr Trp Glu Ala 
10 15 

Val Glu Val Phe Lys Ala Cys 
25 30 

Ser Arg Glu Asp Phe Lys Thr 
40 45 



23/82 



WO 00/77040 



Ala 


Val 


Val 


Met 


Leu 

D U 


Pne 


Gly Tyr 


Lys 


Asp 


Ser 


Val 


Met 


Ser 


Ser 


lie 


Asn 


Pro 








65 










Leu 


Glu 


Gly Phe 


Leu 
80 


Asn 


lie 


Val 


Arg 


Arg 


Tyr 


Arg 


Asn 


Glu 
95 


Val 


Arg 


His 


He 


Tyr 


Tyr 


Arg 


Gly 


Phe 


Leu 


Thr 


Leu 


Glu 










110 










Arg 


Gin 


Val 


Ala 


Pro 


Lys 


Leu 


Pro 


Glu 










125 










Phe 


Arg 


Glu 


Val 


Asp 


Arg Asp 


Ser Asp 










140 










Asp 


Phe 


Glu 


Tyr 


Ala 


Leu 


Asn 


Tyr 


Gly 










155 











<210> 24 
<211> 354 
<212> PRT 

<213> Homo sapiens 



PCT7US00/16636 



Pro 


Ser 


Lys 


He 


Glu 


Val 


55 








60 


Asn 


Thr 


Ser 


Gly 


He 


Leu 


70 










75 


Lys 


Lys 


Lys 


Glu 


Ala 


Gin 


85 










90 


Phe 


Thr 


Ala 


Phe 


Asp 


Thr 


100 










105 


Asp 


Phe 


Lys 


Lys 


Ala 


Phe 


115 










120 


Arg 


Thr 


Val 


Leu 


Glu 


Val 


130 










135 


Gly 


His 


Val 


Ser 


Phe 


Arg 


145 










150 


Gin 


Lys 


Glu 


Ala 







160 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1867626CD1 



<400> 24 



Met 


Gly 


Glu 


Gin 


Pro 


He 


Phe 


Ser 


Thr 


Arg 


Ala 


His 


Val 


Phe 


Gin 


1 








5 










10 










15 


He 


Asp 


Pro 


Asn 


Thr 


Lys 


Lys 


Asn 


Trp 


Val 


Pro 


Thr 


Ser 


Lys 


His 










20 










25 










30 


Ala 


Val 


Thr 


Val 


Ser 


Tyr 


Phe 


Tyr 


Asp 


Ser 


Thr 


Arg 


Asn 


Val 


Tyr 










35 










40 










45 


Arg 


He 


He 


Ser 


Leu 


Asp 


Gly 


Ser 


Lys 


Ala 


He 


He 


Asn 


Ser 


Thr 










50 










55 










60 


He 


Thr 


Pro 


Asn 


Met 


Thr 


Phe 


Thr 


Lys 


Thr 


Ser 


Gin 


Arg 


Phe 


Gly 










65 










70 








75 


Gin 


Trp 


Ala 


Asp 


Ser 


Arg 


Ala 


Asn 


Thr 


Val 


Tyr 


Gly 


Leu 


Gly 


Phe 










80 










85 










90 


Ser 


Ser 


Glu 


His 


His 


Leu 


Ser 


Lys 


Phe 


Ala 


Glu 


Lys 


Phe 


Gin 


Glu 










95 








100 








105 


Phe 


Lys 


Glu 


Ala 


Ala 


Arg 


Leu 


Ala 


Lys 


Glu 


Lys 


Ser 


Gin 


Glu 


Lys 










110 










115 










120 


Met 


Glu 


Leu 


Thr 


Ser 


Thr 


Pro 


Ser 


Gin 


Glu 


Ser 


Ala 


Gly 


Gly 


Asp 










125 










130 






135 


Leu 


Gin 


Ser 


Pro 


Leu 


Thr 


Pro 


Glu 


Ser 


He 


Asn 


Gly 


Thr 


Asp 


Asp 










140 










145 






150 


Glu 


Arg 


Thr 


Pro 


Asp 


Val 


Thr 


Gin 


Asn 


Ser 


Glu 


Pro 


Arg 


Ala 


Glu 










155 










160 








165 


Pro 


Thr 


Gin 


Asn 


Ala 


Leu 


Pro 


Phe 


Ser 


His 


Ser 


Ser 


Ala 


He 


Ser 










170 










175 










180 


Lys 


His 


Trp 


Glu 


Ala 


Glu 


Leu 


Ala 


Thr 


Leu 


Lys 


Gly 


Asn 


Asn 


Ala 










185 










190 










195 


Lys 


Leu 


Thr 


Ala 


Ala 


Leu 


Leu 


Glu 


Ser 


Thr 


Ala 


Asn 


Val 


Lys 


Gin 










200 










205 








210 


Trp 


Lys 


Gin 


Gin 


Leu 


Ala 


Ala 


Tyr 


Gin 


Glu 


Glu 


Ala 


Glu 


Arg 


Leu 










215 










220 










225 


His 


Lys 


Arg 


Val 


Thr 


Glu 


Leu 


Glu 


Cys 


Val 


Ser 


Ser 


Gin 


Ala 


Asn 










230 










235 










240 


Ala 


Val 


His 


Thr 


His 


Lys 


Thr 


Glu 


Leu 


Asn 


Gin 


Thr 


He 


Gin 


Glu 










245 










250 










255 


Leu 


Glu 


Glu 


Thr 


Leu 


Lys 


Leu 


Lys 


Glu 


Glu 


Glu 


He 


Glu 


Arg 


Leu 










260 










265 










270 


Lys 


Gin 


Glu 


He 


Asp 


Asn 


Ala 


Arg 


Glu 


Leu 


Gin 


Glu 


Gin 


Arg 


Asp 










275 










280 










285 


Ser 


Leu 


Thr 


Gin 


Lys 


Leu 


Gin 


Glu 


Val 


Glu 


He 


Arg 


Asn 


Lys 


Asp 










290 










295 










300 


Leu 


Glu 


Gly 


Gin 


Leu 


Ser 


Asp 


Leu 


Glu 


Gin 


Arg 


Leu 


Glu 


Lys 


Ser 



24/82 
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# 



305 310 315 

Gin Asn Glu Gin Glu Ala Phe Arg Asn Asn Leu Lys Thr Leu Leu 
320 325 330 

Glu lie Leu Asp Gly Lys lie Phe Glu Leu Thr Glu Leu Arg Asp 
335 340 345 

Asn Leu Ala Lys Leu Leu Glu Cys Ser 
350 

<210> 25 
<211> 365 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1990126CD1 



<400> 25 



Met 


Asn 


He 


Met 


Asp 


Phe 


Asn 


Val 


1 








5 








Gly 


Thr 


Phe 


Leu 


Ser 


Arg 


Ala 


Val 








20 








Gly Gin Ala 


Glu 


Lys 


Thr 


Glu 


Leu 










35 








Leu 


Ser 


Lys 


Ala 


Glu 
50 


Pvs 


Thr 


Lys 


Lys 


Gin 


Thr 


Glu 


Val 


Leu 


Leu 


Gin 








65 








G1U 


Glu 


Phe 


Val 


Tvr 
j 

80 


Glu 


Lys 


Leu 


Tl a 

J. J_e 


Asn 


Asn 


Pro 


Glu 
95 


Leu 


Leu 


Glv 


Thr 


Glu 


Phe 


Glv 


Pro 
110 


Glv 


Thr 


Ala 


Cys 


Gly Glu 


Thr 


Gin 


Lys 


Arg 


He 








125 








Tip 


Gin 


Thr 


Ser 


Ala 
140 


Leu 


Asn 


Phe 


He 


Glu 


Gly 


ASD 


Tvr 

* Jr ■** 

155 


Lvs 


Thr 


He 


Gin 


Asn 


Lys 


Arcr 


Leu 
170 


Asp 


Leu 


Asp 


Lys 


Ala 


Lys 


Ala 


Ala 


Glu 


Thr 


Arg 








185 








Arg 


He 


Thr 


Gin 


Ser 


Glu 


Phe 


Asp 








200 








Leu 


Leu 


Leu 


Glu 


Gly 
215 


He 


Ser 


Ser 


Cys 


Leu 


Asn 


Asp 


Phe 


Val 


Glu 


Ala 








230 








Cys 


Tyr 


Gin 


Tyr 


Met 


Leu 


Asp 


Leu 








245 








Pro 


Ser 


Asn 


Tyr 


Leu 
260 


Ser 


Asn 


Asn 


Val 


Pro 


Ser 


Val 


Leu 
275 


Pro 


Asn 


Ala 


Ser 


Thr 


Ser 


Gly 


Leu 


Val 


He 


Thr 








290 








Leu 


Lys 


Glu 


Cys 


Ser 


Gly 


Ser 


Arg 








305 








Tyr 


Asp 


Ala 


Ala 


Asn 


Ser 


Thr 


Glu 








320 








Val 


He 


Thr 


Val 


Phe 
335 


Ser 


Val 


Val 


Met 


Gly Glu 


Arg 


Gly Asn 


Gin 


Lys 










350 








Leu 


Glu 


Leu 


Leu 


Asn 
365 









Lys 


Lys 


Leu 


Ala 


Ala 


Asp 


Ala 


10 










15 


Gin 


Phe 


Thr 


Glu 


Glu 


Lys 


Leu 




25 










30 


Asp 


Ala 


His 


Leu 


Glu 


Asn 


Leu 


40 










45 


He 


Trp 


Thr 


Glu 


Lvs 


He 


Met 




55 










60 


Pro 


Asn 


Pro 


Asn 


Ala 


Arg 


He 




70 










75 


Asp 


Arg 


Lys 


Ala 


Pro 


Ser 


Arg 




85 










90 


Gin 


Tyr 


Met 


He 


Asp 


Ala 


Gly 




100 










105 


Tyr Gly Asn Ala 


Leu 


He 


Lys 




115 










120 


Gly 


Thr 


Ala 


Asp 


Arg 


Glu 


Leu 


130 










135 


Leu 


Thr 


Pro 


Leu 


Arg 


Asn 


Phe 




145 










150 


Ala 


Lys 


Glu 


Arg 


Lys 


Leu 


Leu 




160 










165 


Ala 


Ala 


Lys 


Thr 


Arg 


Leu 


Lys 




175 










180 


Asn 


Ser 


Ser 


Glu 


Gin 


Glu 


Leu 




190 










195 


Arg 


Gin 


Ala 


Glu 


He 


Thr 


Arg 




205 










210 


Thr 


His 


Ala 


His 


His 


Leu 


Arg 




220 










225 


Gin 


Met 


Thr 


Tyr 


Tyr 


Ala 


Gin 




235 










240 


Gin 


Lys 


Gin 


Leu 


Gly 


Ser 


Phe 




250 










255 


Asn 


Gin 


Thr 


Ser 


Val 


Thr 


Pro 




265 










270 


He 


Gly 


Ser 


Ser 


Ala 


Met 


Ala 




280 










285 


Ser 


Pro 


Ser 


Asn 


Leu 


Ser 


Asp 




295 










300 


Lys 


Ala 


Arg 


Val 


Leu 


Tyr 


Asp 


310 










315 


Leu 


Ser 


Leu 


Leu 


Ala 


Asp 


Glu 




325 










330 


Gly Met 


Asp 


Ser 


Asp 


Trp 


Leu 




340 










345 


Gly 


Lys 


Val 


Pro 


He 


Thr 


Tyr 


355 










360 



25/82 
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<210> 26 












<211> 274 












<212> PRT 












<213> Homo sapiens 










<220> 












<221> misc_f eature 










<223 > Incyte ID 


No: 


2104180CD1 




<400> 26 












Met Ala Thr Thr 


Val 


Ser 


Thr 


Gin Arg 


1 


5 










Glu Leu Pro Gin 


Asp 


Phe 


Leu 


Arg 


He 




20 










Arg Gin Val Gin 


Leu 


Asp 


Ala 


Gin 


Ala 


35 










Gly Gly Ala Val 


Gly 


Thr 


Val 


Gly Arg 


50 










Gin Ala Lys Leu 


Ala 


Lys 


Asn 


Tyr 


Gly 




65 










Tyr Cys Arg Leu 


Arg 


Leu 


Gly 


Tyr 


Ala 


80 










Ala His Asn Gly Ala 


Lys 


Asn 


Pro 


Arg 




95 










Cys Thr Val Pro 


Pro 


Gly Val 


Asp 


Ser 


110 










Asp Glu Arg Ala 


Phe 


Ser 


Met 


Asp 


Asp 


125 










lie Thr lie Pro 


Glu 


Ser 


Leu 


Arg 


Gin 




140 










Trp Tyr Ser Leu 


Ser 


Gly Arg 


Gin Gly 


155 










lie Asn Leu Val 


Met 


Ser 


Tyr 


Ala 


Leu 




170 










Met Pro Pro Gin 


Pro 


Val 


Val 


Leu 


Met 




185 










Gly Val Gly Tyr 


Val 


Pro 


He 


Thr 


Gly 


200 










Pro Gly Met Val 


Pro 


Val 


Ala 


Leu 


Pro 


215 










Gin Pro Arg Cys 


Ser 


Glu 


Glu 


Asp 


Leu 




230 










Phe Pro Asn Met 


Asp 


Gin 


Glu 


Val 


He 




245 










Gin Arg Gly Asn 


Lys 


Asp 


Ala 


Ala 


He 




260 










Gly Glu Glu Pro 












<210> 27 












<211> 129 












<212> PRT 












<213> Homo sapiens 










<220> 












<221> mi sc_f eature 










<223> Incyte ID 


No: 


2122241CD1 




<400> 27 












Met Arg Arg Arg 


Gly 


Glu 


He 


Asp 


Met 


1 


5 










Glu Leu Glu Leu 


Glu 


Thr 


Glu 


Thr 


Ser 




20 










Glu Lys Pro Arg 


Lys 


His 


Asp 


Ser 


Gly 




35 










Val Thr Asp Tyr 


Ala 


Glu 


Glu 


Lys 


Glu 




50 










Glu Thr Ala Met 


Ser 


Val 


He 


Gly Asp 



PCT/US00/16636 



Gly 


Pro 


Val 


Tyr 


He 


Gly 


10 










15 


Thr 


Pro 


Thr 


Gin 


Gin 


Gin 


25 










30 


Ala 


Gin 


Gin 


Leu 


Gin 


Tyr 


40 










45 


Leu 


Asn 


He 


Thr 


Val 


Val 


55 










60 


Met 


Thr 


Arg 


Met 


Asp 


Pro 


70 










75 


Val 


Tyr 


Glu 


Thr 


Pro 


Thr 


85 








90 


Trp 


Asn 


Lys 


Val 


He 


His 


100 










105 


Phe 


Tyr 


Leu 


Glu 


He 


Phe 


115 










120 


Arg 


He 


Ala 


Trp 


Thr 


His 


130 










135 


Gly Lys Val 


Glu Asp 


Lys 


145 










150 


Asp 


Asp 


Lys 


Glu 


Gly Met 


160 










165 


Leu 


Pro 


Ala 


Ala 


Met 


Val 


175 










180 


Pro 


Thr 


Val 


Tyr 


Gin 


Gin 


190 








195 


Met 


Pro 


Ala 


Val 


Cys 


Ser 


205 










210 


Pro 


Ala 


Ala 


Val 


Asn 


Ala 


220 










225 


Lys 


Ala 


He 


Gin 


Asp 


Met 


235 










240 


Arg 


Ser 


Val 


Leu 


Glu 


Ala 


250 










255 


Asn 


Ser 


Leu 


Leu 


Gin 


Met 


265 










270 



Ala Thr 


Glu 


Gly 


Asp 


Val 


10 








15 


Gly Pro 


Glu 


Arg 


Pro 


Pro 


25 








30 


Ala Ala Asp 


Leu 


Glu 


Arg 


40 








45 


He Gin 


Ser 


Ser 


Asn 


Leu 


55 








60 


Arg Arg 


Ser 


Arg 


Glu 


Gin 
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65 










70 








75 


Lys Ala Lys Gin 


Glu 


Arg 


Glu 


Lys 


Glu 


Leu 


Ala 


Lys 


Val Thr 


He 


80 










85 








90 


Lys Lys Glu Asp 


Leu 


Glu 


Leu 


He 


Met 


Thr 


Glu 


Met 


Glu He 


Ser 


95 










100 








105 


Arg Ala Ala Ala 


Glu 


Arg 


Ser 


Leu 


Arg 


Glu 


His 


Met 


Gly Asn 


Val 


110 








115 








120 


Val Glu Ala Leu 


He 
125 


Ala 


Leu 


Thr 


Asn 













<210> 28 
<211> 626 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 2580428CD1 

<400> 28 



Met 


Gin 


Arg 


Ala 


Asp 


Ser 


Glu 


Gin 


Pro 


Ser 


Lys 


Arg 


Pro 


Arg 


Cys 


1 






5 










10 










15 


Asp 


Asp 


Ser 


Pro 


Arg 


Thr 


Pro 


Ser 


Asn 


Thr 


Pro 


Ser 


Ala 


Glu 


Ala 






20 










25 










30 


Asp 


Tro 


Ser 


Pro 


Gly 


Leu 


Glu 


Leu 


His 


Pro 


Asp 


Tyr 


Lys 


Thr 


Trp 






35 










40 










45 


Gly 


Pro 


Glu 


Gin 


Val 


Cys 


Ser 


Phe 


Leu 


Arg 


Arg 


Gly Gly 


Phe 


Glu 








50 










55 










60 


Glu 


Pro 


Val 


Leu 


Leu 


Lys 


Asn 


He 


Arg 


Glu 


Asn 


Glu 


He 


Thr 


Gly 










65 








70 










75 


Ala 


Leu 


Leu 


Pro 


Cys 
80 


Leu 


Asp 


Glu 


Ser 


Arg 
85 


Phe 


Glu 


Asn 


Leu 


Gly 
90 


Val 


Ser 


Ser 


Leu 


Gly 
95 


Glu 


Arg 


Lys 


Lys 


Leu 
100 


Leu 


Ser 


Tyr 


He 


Gin 
105 


Arg 


Leu 


Val 


Gin 


He 


His 


Val 


Asp 


Thr 


Met 


Lys 


Val 


He 


Asn 


Asp 








110 










115 










120 


Pro 


He 


His 


Gly 


His 


He 


Glu 


Leu 


His 


Pro 


Leu 


Leu 


Val 


Arg 


He 








125 










130 










13 5 


He 


Asp 


Thr 


Pro 


Gin 


Phe 


Gin 


Arg 


Leu 


Arg 


Tyr 


He 


Lys 


Gin 


Leu 








140 










145 










150 


Gly 


Gly 


Gly 


Tyr 


Tyr 


Val 


Phe 


Pro 


Gly 


Ala 


Ser 


His 


Asn 


Arg 


Phe 






155 










160 










165 


Glu 


His 


Ser 


Leu 


Gly 
17 0 


Val 


Gly 


Tyr 


Leu 


Ala 
175 


Gly 


Cys 


Leu 


Val 


His 
180 


Ala 


Leu 


Gly 


Glu 


Lys 


Gin 


Pro 


Glu 


Leu 


Gin 


He 


Ser 


Glu 


Arg 


Asp 








185 










190 










195 


Val 


Leu 


Cys 


Val 


Gin 


He 


Ala 


Gly 


Leu 


Cys 


His 


Asp 


Leu Gly 


His 








200 










205 










210 


Gly 


Pro 


Phe 


Ser 


His 


Met 


Phe 


Asp 


Gly 


Arg 


Phe 


He 


Pro 


Leu 


Ala 








215 










220 










225 


Arg 


Pro 


Glu 


Val 


Lys 


Trp 


Thr 


His 


Glu 


Gin 


Gly 


Ser 


Val 


Met 


Met 








230 










235 










240 


Phe 


Glu 


His 


Leu 


He 


Asn 


Ser 


Asn 


Gly 


He 


Lys 


Pro 


Val 


Met 


Glu 










245 








250 










255 


Gin 


Tyr 


Gly 


Leu 


He 


Pro 


Glu 


Glu 


Asp 


He 


Cys 


Phe 


He 


Lys 


Glu 






260 










265 










270 


Gin 


He 


Val 


Gly 


Pro 


Leu 


Glu 


Ser 


Pro 


Val 


Glu Asp 


Ser 


Leu 


Trp 








275 










280 










285 


Pro 


Tyr 


Lys 


Gly 


Arg 


Pro 


GlU 


Asn 


Lys 


Ser 


Phe 


Leu 


Tyr 


Glu 


He 








290 










295 










300 


Val 


Ser 


Asn 


Lys 


Arg 
305 


Asn 


Gly 


He 


Asp 


Val 
310 


Asp 


Lys 


Trp 


Asp 


Tyr 
315 


Phe 


Ala 


Arg 


Asp 


Cys 
320 


His 


His 


Leu 


Gly 


He 
325 


Gin 


Asn 


Asn 


Phe 


Asp 
330 


Tyr 


Lys 


Arg 


Phe 


He 


Lys 


Phe 


Ala 


Arg 


Val 


Cys 


Glu 


Val 


Asp 


Asn 






335 










340 










345 


Glu 


Leu 


Arg 


He 


Cys 


Ala 


Arg 


Asp 


Lys 


Glu 


Val 


Gly Asn 


Leu 


Tyr 



350 355 360 
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Asp Met 


Phe 


His 


Thr 




Asn 


Ser 


Leu 








365 










His Lys 


Val 


Gly Asn 


He 


He 


Asp 


Thr 








380 










Leu Lys 


Ala 


Asp 


Asp 


Tyr 


He 


Glu 


He 








395 










Lys Tyr 


Arg 


He 


Ser 


Thr 


Ala 


He 


Asp 








410 










Lys Leu 


Thr 


Asp 


Asn 


He 


Phe 


Leu 


Glu 




425 










Pro Lys 


Leu 


Lys 


Asp Ala Arg 


Glu 


He 








440 










Arg Asn 


Leu 


Phe 


Lvs 


TVX 

* J: 


Val 


Gly Glu 








455 










lie Lys 


lie 


Lys 


Arcr 


Glu 


Asp 


Tvr 


Glu 








470 










Ala Ser 


Ala 


Lys 


Pro 


Lvs 


Val 


Leu 


Leu 








485 










Glu Asp 


Phe 


He 


Val 


Asp 


Val 


He 


Asn 








500 










Glu Lys 


Asn 


Pro 


He 


Asp 


His 


Val 


Ser 








515 










Pro Asn 


Arg 


Ala 


He 


Arg 


He 


Thr 


Lys 








53 0 










Leu Pro 


Glu 


Lvs 


Phe 


Ala 


Glu 


Gin 


Leu 








545 










Lys Val 


Asp 


Arcr 


Lys 


Ser 


Leu 


Tyr 


Ala 








560 










Gin Trp 


Cys 


Ala 


Asp 


Arg 


Asn 


Phe 


Thr 








575 










Val lie 


Ala 


Pro 


Leu 


He 


Thr 


Pro 


Gin 








590 










Ser Thr 


Ser 


Val 


Gin 


Asn 


Pro 


Thr 


Arg 








605 








Ser Arg 


Val 


Gin 


Leu 


Phe 


Lys 


Asp 


Asp 








620 










<210> 29 














<211> 157 














<212> PRT 














<213> Homo sapiens 










<220> 
















<221> mis cofeature 










<223> Incyte ID 


NO: 


3397189CD1 




<400> 29 














Met Ala 


Pro 


Lys 


Lys 


Leu 


Ser 


Cys 


Leu 


1 






5 










Leu Ser 


Leu 


Thr 


Leu 


Leu 


Leu 


Pro 


Gin 








20 










Val Val 


Asp 


Arg 


Gly 


His 


Asp 


Arg 


Phe 








35 










Phe Arg 


Tyr 


Val 


Ser 


Gly 


Ser 


Leu 


His 








50 










Val Leu 


Trp 


Ala 


Asp 


Arg 


Leu 


Leu 


Lys 








65 








Asn Ala 


He 


Gin 


Phe 


Tyr 


Val 


Pro 


Trp 








80 










Pro Gly 


Val 


Tyr 


Asn 


Phe 


Asn 


Gly 


Ser 








95 








Leu Asn 


Glu 


Ala 


Ala 


Leu 


Ala 


Asn 


Leu 








110 










Gly Pro 


Tyr 


He 


Cys 


Ala 


Glu 


Trp 


Glu 








125 










Trp Leu 


Leu 


Arg 


Lys 


Pro 


Glu 


He 


His 








140 










Gly Glu 


Leu 


Arg 


Gin 


Arg 


He 







PCT/US00/16636 



His 


Arg Arg Ala Tyr 


Gin 


370 








375 


Met 


He Thr Asp Ala 


Phe 


385 








390 


Thr 


Gly Ala Gly Gly 


Lys 


400 








405 


Asp 


Met 


Glu 


Ala Tyr 


Thr 


415 








420 


He 


Leu 


Tvr 


Ser Thr 


Asp 


430 








435 


Leu 


Lvs 


Gin 


He Glu 


Tyr 


445 








450 


Thr 


Gin 


Pro 


Thr Gly 


Gin 


460 






465 


Ser 


Leu 


Pro 


Lvs Glu 


Val 


475 








480 


Asp 


Val 


Lys 


Leu Lys 


Ala 


490 








495 


Met 


Asp 


Tyr 


Gly Met 


Gin 


505 








510 


Phe 


Tvrr 


Cys 


Lys Thr 


Ala 


520 








525 


Asn 


Gin 


Val 


Ser Gin 


Leu 


535 








540 


He 


Arg 


Val 


Tyr Cys 


Lys 


550 








555 


Ala 


Arg 


Gin Tyr Phe 


Val 


565 








570 


Lvs 


Pro 


Gin Asp Gly 


Asp 


580 








585 


Lys 


Lys 


Glu 


Trp Asn 


Asp 


595 








600 


Leu 


Arg 


Glu 


Ala Ser 


Lys 


610 








615 


Pro 


Met 








625 











Arg 


Ser 


Leu 


Leu 


Leu 


Pro 


10 










15 


Ala 


Asp 


Thr 


Arg 


Ser 


Phe 


25 










30 


Leu 


Leu Asp Gly Ala 


Pro 


40 










45 


Tyr 


Phe 


Arg Val 


Pro 


Arg 


55 










60 


Met 


Arg 


Trp 


Ser 


Gly 


Leu 


70 










75 


Asn 


Tyr 


His 


Glu 


Pro 


Gin 


85 










90 


Arg 


Asp 


Leu 


He 


Ala 


Phe 


100 










105 


Leu 


Val 


He 


Leu 


Arg 


Pro 


115 










120 


Met 


Gly Gly 


Leu 


Pro 


Ser 


130 










13 5 


Leu 


Arg 


Thr 


Ser 


Asp 


Pro 


145 










150 
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<210> 30 

<211> 383 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 4881249CD1 

<400> 30 



Met 


Leu 


Ser 


Arg 


Lys 


Lys 


Thr 


Lys 


Asn 


Glu 


Val 


Ser 


Lys 


Pro 


Ala 


1 








5 










10 










15 


Glu 


Val 


Gin Gly 


Lys 


Tyr 


Val 


Lys 


Lys 


Glu 


Thr 


Ser 


Pro 


Leu 


Leu 










20 










25 










30 


Ara 


Asn 


Leu 


Met 


Pro 


Ser 


Phe 


He Arg 


His 


Gly 


Pro 


Thr 


He 


Pro 








35 










40 










45 


Ar or 


Ara 


Thr 


Asp 


He 


Cys 


Leu 


Pro 


Asp 


Ser 


Ser 


Pro 


Asn 


Ala 


Phe 




50 










55 










60 


Ser 


Thr 


Ser 


Gly 


Asp 


Val 


Val 


Ser 


Arg 


Asn 


Gin 


Ser 


Phe 


Leu 


Arg 








65 










70 










75 


Thr 


Pro 


He 


Gin 


Ara 
80 


Thr 


Pro 


His 


Glu 


He 
85 


Met 


Arg 


Arg 


Glu 


Ser 
90 


Asn 


Ara 


Leu 


Ser 


Ala 


Pro 


Ser 


Tyr 


Leu 


Ala 


Arg 


Ser 


Leu 


Ala 


Asp 








95 








100 










105 


Val 


Pro 


Arg 


Glu 


TVX 


Gly Ser 


Ser 


Gin 


Ser 


Phe 


Val 


Thr 


Glu 


Val 








110 










115 










120 


Ser* 


Phe 


Ala 


Val 


Glu 


Asn 


Gly Asp 


Ser 


Gly 


Ser 


Arg 


Tyr 


Tyr 


Tyr 










125 










130 










135 


Ser 


Asp 


Asn 


Phe 


Phe 


Asp 


Gly Gin Arg 


Lys 


Arg 


Pro 


Leu 


Gly 


Asp 








140 










145 










150 


Arg 


Ala 


His 


Glu 


Asp 
155 


Tyr 


Arg 


Tyr 


Tyr 


Glu 
160 


Tyr 


Asn 


His 


Asp 


Leu 
165 


Phe 


Gin 


Arg 


Met 


Pro 


Gin 


Asn 


Gin 


Gly 


Arg 


His 


Ala 


Ser 


Gly 


He 








170 










175 










180 


Glv 


Ara 


Val 


Ala 


Ala 


Thr 


Ser 


Leu 


Gly 


Asn 


Leu 


Thr 


Asn 


His 


Gly 






185 










190 










195 


Ser 


Glu 


Asp 


Leu 


Pro 


Leu 


Pro 


Pro 


Gly 


Trp 


Ser 


Val 


Asp 


Trp 


Thr 








200 










205 










210 


Met 


Arg 


Gly Arg 


Lys 


Tyr 


Tyr 


He 


Asp 


His 


Asn 


Thr 


Asn 


Thr 


Thr 










215 










220 










225 


His 


Trp 


Ser 


His 


Pro 


Leu 


Glu 


Arg 


Glu 


Gly 


Leu 


Pro 


Pro 


Gly 


Trp 








230 










235 










240 


Glu 


Arg 


Val 


GlU 


Ser 


Ser 


Glu 


Phe Gly Thr 


Tyr 


Tyr 


Val 


Asp 


His 








245 










250 










255 


Thr 


Asn 


Lys 


Lys 


Ala 


Gin 


Tyr 


Arg 


His 


Pro 


Cys 


Ala 


Pro 


Ser 


Val 








260 










265 










270 


Pro 


Arg 


Tyr Asp 


Gin 


Pro 


Pro 


Pro 


Val 


Thr 


Tyr 


Gin 


Pro 


Gin 


Gin 








275 










280 










285 


Thr 


Glu 


Arg Asn 


Gin 


Ser 


Leu 


Leu 


Val 


Pro 


Ala 


Asn 


Pro 


Tyr 


His 










290 










295 










300 


Thr 


Ala 


Glu 


He 


Pro 


Asp 


Trp 


Leu 


Gin 


Val 


Tyr Ala Arg Ala 


Pro 










305 






310 










315 


Val 


Lys 


Tyr Asp 


His 


He 


Leu 


Lys 


Trp 


Glu 


Leu 


Phe 


Gin 


Leu 


Ala 










320 










325 










330 


Asp 


Leu 


Asp 


Thr 


Tyr 


Gin Gly Met 


Leu 


Lys 


Leu 


Leu 


Phe 


Met 


Lys 








335 










340 










345 


Glu 


Leu 


Glu 


Gin 


He 


Val 


Lys 


Met 


Tyr 


Glu 


Ala 


Tyr 


Arg 


Gin 


Ala 










350 








355 










360 


Leu 


Leu 


Thr 


Glu 


Leu 


Glu 


Asn 


Arg 


Lys 


Gin 


Arg Gin Gin Trp 


Tyr 










365 










370 










375 


Ala 


Gin 


Gin 


His 


Gly 
380 


Lys 


Asn 


Phe 

















<210> 31 

<211> 478 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 43187 1CD1 

<400> 31 



Met 


Asp 


Thr 


Ser 


Asp 
5 


Leu 


Phe 


Ala 


J. 

Gly Arg Val 


Arg 


Tvr 


Leu 


Leu 


Glu 










20 








Val 


Arg Asp 


Lvs 


Trp 


Asp 


Ser 


Thr 










35 








Cys Gly His 


GlU 


Glu 


Leu 


Val 


Leu 










50 








Arg 


Cys 


Glu 


Ala 


Asn 


Thr 


Phe 


Asp 






65 








Ala 


Leu 


Ser 


Asp 


Pro 


He 


Arg 


Arg 










80 








Val 


Thr 


Ala 


Ser 


Cvs 


Arcr 


Arg 


Ara 










95 








Gin 


Arg 


Leu 


Leu 


Glu 


Gin 


Glv 


He 








110 








Val 


His 


Gly 


Lys 


Pro 


Phe 


Arg 


Val 








125 








Arg 


Ser 


Ala 


Jf 


Phe 


Ala 


Asn 


Met 








140 








Lys 


Ser 


Val 


Val 


Val 
155 


Leu 


Arg 


His 


Phe 


Gly Ala 


Leu 


Leu 


Gin 


Tvr 


Leu 










170 








Gly Val 


(31 u 


His 


Val 


Ser 


Asp 


Cys 










185 








Gin 


Leu 


Trp 


Asp 


Leu 


Leu 


Ser 


Asp 








200 








Val 


Ser 


Glu 


Phe 


Val 


Ala 


Ser 


Lys 










215 








Leu 


Thr 


He 


Glu 


Pro 


Pro 


Pro 


Ala 










23 0 








Met 


Ala 


Leu 


Leu 


Ala 


Asp 


Cvs 


Ala 










245 








Asp 


Leu 


Trp 


Glu 


Leu 


Pro 


Phe 


Pro 








260 








Cys 


Pro 


Asp 


He 


Cvs 


Phe 


Aror 


Val 








275 








His 


Lys 


Ala 


Phe 


Phe 


Cys 


Glv 


Arcr 








290 








Leu 


Asp 


Asp 


His 


Phe 


Aro: 


Glu 


Ser 








305 








Gly 


Pro 


Pro 


Ala 


Val 


Thr 


Leu 


His 








320 








Thr 


His 


Val 


Leu 


TVr 


Tvr 


Met 


Tyr 










335 








Pro 


Glu 


Ala 


Ala 


Tvr 


Asp 


Val 


Leu 










350 








Leu 


Pro 


Gly 


Leu 


Lvs 


Arg 


Leu 


Cys 










365 








Leu 


Asp 


Glu 


Asp 


Thr 


Val 


Val 


Gly 










380 








Phe 


Arg 


Leu 


Ala 


Arg 


Leu 


Glu 


Asp 










395 








Lys 


Val 


He 


Glu 


Lys 


Leu 


Val 


Glu 








410 








Val 


Lys 


Glu 


Glu 


Ala 


Ala 


Ala 


Val 








425 








Ser 


He 


Pro 


Leu 


Val 


Asp 


Asp 


He 










440 








Val 


Gin 


Thr 


Tyr 


Ser 


Ala 


He 


Glu 



455 



Ser 


Cys 


Arg 


Lys 


Gly Asp 


val 




10 










15 


Gin 


Arg 


Asp 


Val 


Glu 


Val 


Asn 




25 








30 


Pro 


Leu 


Tyr 


Tyr 


Ala 


Cys 


Leu 




40 








45 


Tyr 


Leu 


Leu Ala Asn Gly Ala 


55 










60 


Gly 


Glu 


Arg 


Cys 


Leu Tyr Gly 


70 










75 


Ala 


Leu 


Arg Asp 


Tyr 


Lys 


Gin 




85 










90 


Asp 


Tyr 


Tyr 


Asp 


Asp 


Phe 


Leu 




100 










105 


His 


Ser 


Asp Val Val 


Phe 


Val 




115 










120 


His 


Arg 


Cys 


Val 


Leu 


Gly Ala 




13 0 










135 


Leu 


Asp 


Thr 


Lys 


Trp 


Lys 


Glv 




145 










150 


Pro 


Leu 


He 


Asn 


Pro 


Val 


Ala 




160 










165 


Tvr 


Thr 


Gly Arg 


Leu Asp 


He 




175 










180 


Glu 


Arg 


Leu 


Ala 


Lys 


Gin 


Cvs 




190 










195 


Leu 


Glu 


Ala 


Lys 


Cys 


Glu 


Lvs 




205 










210 


Pro 


Gly 


Thr 


Cys 


Val 


Lys 


Val 




220 










225 


Asp 


Pro 


Arg 


Leu 


Arg 


Glu 


Asp 


235 










240 


Leu 


Pro 


Pro 


Glu 


Leu Arg 


Gly 




250 










255 


Cvs 


Pro 


Asp 


Gly 


Phe 


Asn 


Ser 


265 










270 


Ala 


Gly 


Cys 


Ser 


Phe 


Leu 


cys 




280 










285 


Ser 


Asp 


Tyr 


Phe 


Arg 


Ala 


Leu 




295 










300 


Glu 


Glu 


Pro 


Ala 


Thr 


Ser 


Gly 




310 










315 


Gly 


He 


Ser 


Pro 


Asp 


Val 


Phe 




325 










330 


Ser 


Asp 


His 


Thr 


GlU 


Leu 


Ser 




340 










345 


Ser 


Val 


Ala Asp 


Met 


Tyr 


Leu 




355 










360 


Gly Arg 


Ser 


Leu 


Ala 


Gin 


Met 




370 










375 


Val 


Trp 


Arg 


Val 


Ala 


Lys 


Leu 




385 










390 


Gin 


Cys 


Thr 


Glu 


Tyr 


Met 


Ala 




400 










405 


Arg 


Glu 


Asp 


Phe 


Val 


Glu 


Ala 


415 








420 


Ala 


Ala 


Arg 


Gin 


Glu 


Thr 


Asp 




430 










435 


Arg 


Phe 


His 


Val 


Ala 


Ser 


Thr 


445 










450 


Glu 


Ala 


Gin 


Gin 


Arg 


Leu 


Arg 




460 










465 
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# 



Ala Leu Glu Asp Leu lSu Val Ser lie Gly Leu Asp Cys 
470 475 

<210> 32 

<211> 275 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 526155CD1 

<400> 32 

Met Ser Ala Glu Val Lys Val Thr Gly Gin Asn Gin Glu Gin Phe 
1 5 10 15 

Leu Leu Leu Ala Lys Ser Ala Lys Gly Ala Ala Leu Ala Thr Leu 
20 25 30 

lie His Gin Val Leu Glu Ala Pro Gly Val Tyr Val Phe Gly Glu 
35 40 45 

Leu Leu Asp Met Pro Asn Val Arg Glu Leu Ala Glu Ser Asp Phe 
50 55 60 

Ala Ser Thr Phe Arg Leu Leu Thr Val Phe Ala Tyr Gly Thr Tyr 
65 70 75 

Ala Asp Tyr Leu Ala Glu Ala Arg Asn Leu Pro Pro Leu Thr Glu 
80 85 90 

Ala Gin Lys Asn Lys Leu Arg His Leu Ser Val Val Thr Leu Ala 
95 100 105 

Ala Lys Val Lys Cys lie Pro Tyr Ala Val Leu Leu Glu Ala Leu 

110 115 120 

Ala Leu Arg Asn Val Arg Gin Leu Glu Asp Leu Val lie Glu Ala 

125 130 135 

Val Tyr Ala Asp Val Leu Arg Gly Ser Leu Asp Gin Arg Asn Gin 

140 145 150 

Arg Leu Glu Val Asp Tyr Ser lie Gly Arg Asp He Gin Arg Gin 

155 160 165 

Asp Leu Ser Ala He Ala Arg Thr Leu Gin Glu Trp Cys Val Gly 

170 175 180 

Cys Glu Val Val Leu Ser Gly He Glu Glu Gin Val Ser Arg Ala 

185 190 195 

Asn Gin His Lys Glu Gin Gin Leu Gly Leu Lys Gin Gin He Glu 

200 205 210 

Ser Glu Val Ala Asn Leu Lys Lys Thr He Lys Val Thr Thr Ala 

215 220 225 

Ala Ala Ala Ala Ala Thr Ser Gin Asp Pro Glu Gin His Leu Thr 

230 235 240 

Glu Leu Arg Glu Pro Ala Pro Gly Thr Asn Gin Arg Gin Pro Ser 

245 250 255 

Lys Lys Ala Ser Lys Gly Lys Gly Leu Arg Gly Ser Ala Lys He 

260 ~ 265 270 

Trp Ser Lys Ser Asn 

275 

<210> 33 

<211> 217 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 676234CD1 

<400> 33 

Met Ala Ser Thr Gly Leu Glu Leu 

1 5 
Leu Gly Trp Leu Gly Thr Leu Val 
20 

Lys Val Thr Ala Phe He Gly Asn 



Leu Gly Met Thr Leu Ala Val 
10 15 

Ser Cys Ala Leu Pro Leu Trp 
25 30 

Ser He Val Val Ala Gin Val 
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35 










Val Trp 


Glu 


Gly 


Leu 


Trp 


Met 


Ser 


Cys 






50 










Gin Met 


Gin 


Cys 


Liys 


vai 


Tyr 


7V — . M 

ASp 


Ser 








65 










Asp Leu 


Gin 


Ala 


Ala 


Arg 


Ala 


Leu 


Cys 






80 










Ala Leu 


Leu 


Gly 


Leu 


Lieu 


vai 


Ala 


He 






95 










Thr Cys 


Val 


Glu 


Asp 


Glu 


Gly 


Ala 


Lys 








tin 

110 










Ala Gly 


Val 


lie 


Leu 


Leu 


Leu 


Ala Gly 








"IOC 

125 










Val Cys 


Trp 


Thr 


Ala 


HIS 


7V 1 — , 

Ala 


He 


He 








140 










Leu Val 


Ala 


Glu 


Ala 


Leu 


Lys 


Arg 


Glu 








"ICC 

155 










Leu Gly 


Trp 


Ala 


Ala 


TV 1 — , 

Ala 


7V T . 

Ala 


Leu 


Leu 




170 










Leu Cys 


Cys 


Thr 


Cys 


Pro 


Pro 


Pro 


Gin 






lob 










Pro Arg 


Leu 


Gly 


Tyr 


Ser 


He 


Pro 


Ser 








200 










Leu Asp 


Lys 


Arg 


Asp 


Tyr 


Val 










215 










<210> 34 














<211> 74 














<212> PRT 














<213> Homo sapiens 










<220> 
















<221> misc_feature 










<223> Incyte ID 


No: 


720145CD1 




<400> 34 














Met Asp 


Asp 


Tyr 


Thr 


Ser 


Ala 


He 


Glu 


1 






5 










Val Pro 


Tyr 


Tyr 


Asn 


Arg Gly Leu 


He 








20 










Phe Asp 


Asp 


Ala 


Leu 


Glu 


Asp 


Phe 


Lys 






35 










Pro Gly 


Phe 


Gin 


Asp 


Ala 


Thr 


Leu 


Ser 






50 










Asp Lys 


Glu 


Glu 


Lys 


Gin 


Arg Arg Asn 








65 










<210> 35 














<211> 367 














<212> PRT 














<213> Homo sapiens 










<220> 
















<221> misc_f eature 










<223> Incyte ID 


No : 


1001951CD1 




<400> 35 














Met Val 


Gin 


Gin 


Phe 


Leu 


Arg 


Gin 


Ala 


1 






5 










Lys Glu 


Arg 


Glu 


Gly 


Ala 


Leu 


Val 


Ser 






20 










His Pro 


Glu 


Thr 


Gin 


Gin 


Thr 


Phe 


He 








35 










His Trp 


Val 


Thr 


Leu 


He 


Val 


Glu 


Ser 






50 










Leu Leu 


Pro 


Gin 


Gly 
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Pro Asn Asn 
Ala Leu Ala 
Val He Lys 
Gin Asn Lys 
Phe Phe Lys 
Arg Gly He 
Arg Ala Met 
Lys Asp Cys 
Ala Phe Ala 
Asn Glu Ala 
Gly Asn Lys 
Leu He Glu 
Gin Arg Gin 
Gly Ser Thr 
Asn Leu He 
Ala Pro Glu 
Thr Gly Thr 
Pro Thr Ser 
Val Ser Asp 
Asp Val Cys 
Gin Lys Pro 
Ala Asn Ser 
Ser Pro Asp 
Leu Tyr Pro 
Asn Gin He 



100 

Leu Ser Gin 
115 

Ser Gin Lys 
13 0 

Lys Gin Gly 
145 

Met Asp Ala 
160 

Ser Ala Tyr 
175 

Cys Asn Leu 
190 

Ser Arg Arg 
205 

Ala Lys Lys 
220 

Phe Glu Ala 
235 

Ser Lys Glu 
250 

Ser Thr Glu 
265 

Gin Gin Ala 
280 

Glu Gly Lys 
295 

Ala Ala Asp 
310 

Ala Tyr Leu 
325 

Thr Gin Ala 
340 

Arg Arg Gly 
355 

Lys Gin Asp 
370 

Gin Ala Val 
385 

Lys Gly His 
400 

Asn Val Val 
415 

Lys Pro Leu 
430 

Gin Thr He 
445 

Asn Asn Pro 
460 

Thr Ser Lys 
475 

Asp Thr Pro 
490 

Thr Ser Ser 
505 

Gin Ser Tyr 
520 

Ala Gin Phe 
535 

Phe Gin Leu 
550 

Met Leu Tyr 
565 

Lys Leu Phe 
580 

Val Lys He 
595 



105 

Glu Ser Glu 
120 

Ala Leu Val 
135 

Lys Tyr Asp 
150 

Asp Pro Tyr 
165 

Phe Arg Leu 
180 

Ala Val Ala 
195 

Gly Ala Ala 
210 

Asp Tyr Glu 
225 

Thr Asn Glu 
240 

Asn Ser Tyr 
255 

Gly Glu Arg 
270 

He Ser Glu 
285 

Tyr Glu Arg 
300 

Gly Ala Asn 
315 

Lys He Gin 
330 

He Leu Leu 
345 

Thr Ala Arg 
360 

Phe Glu Thr 
375 

Thr Glu Leu 
390 

Trp Asp Asp 
405 

Lys Pro He 
420 

Lys Lys Val 
43 5 

Asp Val Pro 
450 

He Asn Leu 
465 

Lys Asn Ser 
480 

Arg Ala Lys 
495 

Leu Gin Pro 
510 

Ser Glu Lys 
525 

Ala Thr Thr 
540 

Glu Ser Asp 
555 

Gin Tyr Leu 
570 

Gin Lys Asn 
585 

Leu His Asp 
600 
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Phe 


Tyr 


He 


Glu 


Lys 


Glu 


Lys 


Pro 








sr r\ r* 

605 








Gin Arg 


Leu 


Ser 


GlU 


Leu 


Lys 


Arg 










620 








Met 


Ser 


Glu 


Thr 


GlU 
635 


Lys 


Lys 


He 


He 


Asp 


Lys 


Ser 


Gly 
650 


Leu 


Lys 


Asp 


Lys 


Arg 


Tyr 


Gly 


Gly 














665 









<210> 40 
<211> 125 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc^feature 

<223> Incyte ID No: 1688972CD1 

<400> 40 



Met 


Leu 


Asp 


Leu 


Gin 


Lys 


Gin 


Leu 


1 






5 








val 


Gly 


Thr 


Thr 


GlU 


Pro 


Ala 


Ser 








20 








Pro 


Thr 


Thr 


Ala 


Ala 
35 


Ala 


Thr 


Met 


Ser 


Leu 


Ala 


Pro 


Pro 
50 


Gly 


Glu 


Ala 


Ala 


Pro 


Pro 


Ala 


Ser 
65 


Gly 


Thr 


Arg 


Tyr 


Glu 


Ala 


Ala Asp 


Ser 


Ser 


Glu 








80 








Leu 


He 


Thr 


Val 


Tyr 
95 


Ser 


Leu 


Pro 


He 


Gly 


Glu 


Arg 


Gly 
110 


Asn 


Lys 


Lys 


Leu 


Glu 


Leu 


Leu 


Ser 
125 









<210> 41 
<211> 366 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc^feature 

<223> Incyte ID No: 1812494CD1 

<400> 41 



Met 


Cys 


Tyr 


Phe 


Tyr 


Leu 


Gly 


Asp 


1 








5 








Gin 


Ala 


Phe 


He 


Leu 
20 


Met 


His 


Leu 


Leu 


Asp 


Gly 


Phe 


His 
35 


Met 


Ser 


Gly 


Glu 


Gly 


GlU 


Asp 


Gly 
50 


Phe 


Gin 


Val 


Thr 


Val 


Gly 


Thr 


Leu 
65 


Asp 


Leu 


Cys 


Arg 


Arg 


Arg 


Arg 


Lys 
80 


Arg 


Asn 


Lys 


GlU 


Ala 


Gly 


Ala 


His 
95 


Arg 


Thr 


Leu 


Glu 


Pro 


Ser 


Thr 


Gin 
110 


Ser 


Ser 


Gin 


Pro 


Leu 


Leu 


Asp 


Glu 
125 


Ala 


Lys 


Ala 


Asn 


Ala 


Leu 


Leu 


Ala 


Ala 


Cys 


Arg 



Leu 


Leu 


He 


Pne 


GlU 


lie 


Leu 




610 










615 


Phe 


Asp 


Met 


Ala 


Val 


Met 


Phe 




625 










630 


Ala 


Arg 


Ala 


Leu 


Phe 


Asn 


His 




640 










645 


Ser 


Ser 


Val 


Glu 


Glu 


Leu 


Lys 




ODD 










DDU 


Gly Arg 


file 


Pro 


Gly Thr 


±rne 




10 










1 c: 

X D 


Pro 


Pro 


Leu 


Ser 


Ser 


Thr 


Ser 




25 










j u 


Pro 


Val 


Val 


Pro 


Ser 


Val 


nla 




40 










*± D 


Ser 


Leu 


Cys 


Leu 


Glu 


Glu 


V CL -L 




55 










O U 


Lys 


Ala 


Arg Val 


Leu 


Tyr 


Asp 


70 










75 


Leu 


Ala 


Leu 


Leu 


Ala 


Asp 


Glu 




85 










90 


Gly Met 


Asp 


Pro 


Asp 


Trp 


Leu 




100 










105 


Gly 


Lys 


Val 


Pro 


val 


Thr 


Tyr 




115 












Lys 


He 


Lys 


Thr 


He 


Ser 






10 










± D 


Leu 


Leu 


Pro 


Ser 


Glu 


Tyr 






25 










Phe 


Ser 


Leu Gly 


Ser 


Gly 


Ser 




40 












Glu 


Leu 


Glu 


Leu 


Val 


Glu 


Leu 




55 










fin 


Glu 


Ser 


Glu 


Val 


Leu 


Pro 






70 












Lys 


Glu 


Lys 


Ser 


Arg 


Asp 






85 












Leu 


Gin 


Gin 


Thr 


Gin 


Glu 


Glu 




100 










105 


Ala 


Val 


Ala 


Ala 


Pro 


Leu 


Gly 




115 










120 


Pro 


Gly 


Gin 


Pro 


Glu 


Leu 


Trp 




130 










135 


Ala 


Gly 


Asp 


Val 


Gly Val 


Leu 



41/82 



WO 00/77040 











1 A A 


Lys 


Leu 


Gin 


Leu 


III a 

ilia 










Leu. 


Leu 


Ser 


Ala 


Pro 

1 1 If 


Ala 


Ala 


Ala 


Ala 


Ala 
1 ft ^ 


GlU 


Ala 


Gly Ala 


Asp 










A u u 


Pro 


Tyr 


Thr 


Val 


Ala 








Z ID 


Arg 


irne 


Met 


Glu 


Lys 








9 7 0 


Gin 


Val 


Pro 


Gly 


Pro 

OA 


Thr 


Arg 


Lys 


Arg 


pin 


Gin 


Gin 


Gin Arg 


Gin 










AID 


Gin 


Arg 


Arg 


Phe 


Ala 

Z J U 


Ala 


Ala 


Glu Arg 


Arg 












Pro 


He 


Pro 


Asp 


Ser 


Cys 


Gly 


Ala 


Ser 


Leu 
335 


Phe 


Ser 


Phe 


Cys 


Ser 
350 


Ala 


Gly 


Arg 


Pro 


Ser 
365 



<210> 42 

<211> 173 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<223> Incyte ID No: 



<400> 42 



Met 


Ser 


Thr 


Met 


Gly 


1 








5 


Ser 


His 


Phe 


Asp 


Asn 
20 


Lys 


L Y S 


Leu 


Asp 


Leu 
35 


Phe 


Met 


Ser 


Leu 


Pro 
50 


Val 


He 


Asp 


Val 


Phe 
65 


Lys 


Glu 


Phe 


He 


Leu 








80 


Glu 


Glu 


Gin 


Lys 


Leu 
95 


Lys 


Asp 


Gly 


Tyr 


He 
110 


Met 


Met 


Val 


Gly Asn 










125 


Val 


Asp 


Lys 


Thr 


He 
140 


He 


Ser 


Phe 


Glu 


Glu 
155 


His 


Lys 


Lys 


Leu 


Val 
170 



<210> 43 
<211> 761 













Pro 


Ser 


Pro 


Ala 


Asp 
1 o U 


Leu 


Gly 


Ser 


Gly 


Gly 

± / D 


Gly 


Arg 


Gly 


Ser 


Val 
1 on 

1 27 U 


Pro 


Thr 


Val 


Gin 


Asp 

ZUj 


Ala 


Asp 


Lys 


Ser 


Thr 

zzu 


Asn 


Pro 


Asp 


Ala 


Tyr 

9 


Leu 


Thr 


Pro 


Glu 


Met 


Gin 


Lys 


Ala 


Ala 


Arg 
o a c 


Gin 


Glu 


Gin 


Glu 


GlU 


Ala 


Leu 


Ser 


Asp 


Arg 


Leu 


Ala 


Ala 


Gin 


Leu 


Ala 


lie 


XT-. 1 

vai 


Asn 


Thr 
325 


Gin 


Gly 


Leu 


Thr 


Pro 
340 


Thr 


Arg 


Cys 


Leu 


Gin 
355 


Ser 











2013853CD1 



Asn 


Glu 


Ala 


Ser 


Tyr 










10 


Asp 


Glu 


He 


Lys 


Arg 










25 


Asp 


Lys 


Ser 


Gly 


Ser 










40 


Glu 


Leu Arg 


His 


Asn 










55 


Asp 


Thr 


Asp 


Gly 


Asp 










70 


Gly Thr 


Ser 


Gin 


Phe 










85 


Arg 


Phe 


Ala 


Phe 


Ser 










100 


Ser Asn Gly Glu 


Leu 










115 


Asn 


Leu 


Thr 


Asp 


Trp 










130 


He 


He 


Leu Asp 


Lys 










145 


Phe 


Ser 


Ala 


Val 


Val 










160 


Leu 


He 


Val 















1 C A 

JL3 U 


-fro 


Arg 


Val 


Leu 


Ser 










lb j 


rile 


inr 


Leu 


Leu 


HIS 












Val 


A rrr 
ril y 




T.on 

Uc Li. 


T .011 








1 -7 D 


Ser 


Arg 


Ala 


Arg 


Pro 












Arg 


Asn 


«1 u 


rXIS 


Arg 










ZZ J 


Asp 


ryr 


AS Xi 


Lys 


Ala 










^5 41 u 


pin 


Ala 

Aid 


Arg 


VjlXl 


Ala 








ZDD 


Arg 


bin 


Arg 


blU 


blU 










z / u 


Arg 




Arg 


m ii 

Vj 1 LI 


f21 11 










ZOJ 




Lys 


Arg 


Ala 


Leu 










JuU 




Alex 


Pro 


1X1X7 


Q /—\ -v- 










315 


Arg 


Arg 


Cys 


Trp 


Ser 










330 


Phe 


His 


Tyr 


Leu 


Asp 










345 


Asp 


His 


Arg 


Arg 


Gin 










*2 a n 


Pro 


Aid 


blU 


weti 


Cys 










t c 
ID 


Leu 


plir 

vjiy 


Arg 


Arg 


irne 










^ n 
J u 


T .ah 




val 


VjlU 


pi n 










A tr 


Pro 


Leu 


Val 


Arg 


ai y 










D U 




pin 


Val 


Asp 


T>Vie» 










*7 ^ 


ser 


val 


Lys 


nii» 
oiy 


Asp 










q n 


lit. 




7v en 


lit- (_ 


2\ en 










iUJ 


nit: 




v ax 


T ,m i 


J_i jr o 










1 9 n 

1ZU 


Gin 


Leu 


Gin 


Gin 


Leu 










135 


Asp 


Gly 


Asp 


Gly 


Lys 










150 


Arg 


Asp 


Leu 


GlU 


He 










165 



42/82 



WO 00/77040 

<212> PRT 

<213> Homo sapiens 



# 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 2284925CD1 

<400> 43 

Met Arg Leu Thr Gin Asp Pro lie Gin Val Leu Leu lie Phe Ala 
15 10 15 

Lys Glu Asp Ser Gin Ser Asp Gly Phe Trp Trp Ala Cys Asp Arg 
20 25 30 

Ala Gly Tyr Arg Cys Asn lie Ala Arg Thr Pro Glu Ser Ala Leu 
35 40 45 

Glu Cys Phe Leu Asp Lys His His Glu lie lie Val lie Asp His 
50 55 60 

Arg Gin Thr Gin Asn Phe Asp Ala Glu Ala Val Cys Arg Ser lie 
65 70 75 

Arg Ala Thr Asn Pro Ser Glu His Thr Val lie Leu Ala Val Val 
80 85 90 

Ser Arg Val Ser Asp Asp His Glu Glu Ala Ser Val Leu Pro Leu 
95 100 105 

Leu His Ala Gly Phe Asn Arg Arg Phe Met Glu Asn Ser Ser lie 

110 115 120 

lie Ala Cys Tyr Asn Glu Leu lie Gin lie Glu His Gly Glu Val 

125 130 135 

Arg Ser Gin Phe Lys Leu Arg Ala Cys Asn Ser Val Phe Thr Ala 

140 145 150 

Leu Asp His Cys His Glu Ala lie Glu lie Thr Ser Asp Asp His 

155 160 165 

Val lie Gin Tyr Val Asn Pro Ala Phe Glu Arg Met Met Gly Tyr 

170 175 180 

His Lys Gly Glu Leu Leu Gly Lys Glu Leu Ala Asp Leu Pro Lys 

185 190 195 

Ser Asp Lys Asn Arg Ala Asp Leu Leu Asp Thr lie Asn Thr Cys 

200 205 210 

lie Lys Lys Gly Lys Glu Trp Gin Gly Val Tyr Tyr Ala Arg Arg 

215 220 225 

Lys Ser Gly Asp Ser lie Gin Gin His Val Lys lie Thr Pro Val 

230 235 240 

lie Gly Gin Gly Gly Lys lie Arg His Phe Val Ser Leu Lys Lys 

245 250 255 

Leu Cys Cys Thr Thr Asp Asn Asn Lys Gin lie His Lys lie His 

260 265 270 

Arg Asp Ser Gly Asp Asn Ser Gin Thr Glu Pro His Ser Phe Arg 

275 280 285 

Tyr Lys Asn Arg Arg Lys Glu Ser lie Asp Val Lys Ser lie Ser 

290 295 300 

Ser Arg Gly Ser Asp Ala Pro Ser Leu Gin Asn Arg Arg Tyr Pro 

305 310 315 

Ser Met Ala Arg lie His Ser Met Thr lie Glu Ala Pro lie Thr 

320 325 330 

Lys Val lie Asn lie lie Asn Ala Ala Gin Glu Asn Ser Pro Val 

335 340 345 

Thr Val Ala Glu Ala Leu Asp Arg Val Leu Glu He Leu Arg Thr 

350 355 360 

Thr Glu Leu Tyr Ser Pro Gin Leu Gly Thr Lys Asp Glu Asp Pro 

365 370 ~ 375 

His Thr Ser Asp Leu Val Gly Gly Leu Met Thr Asp Gly Leu Arg 

380 385 390 

Arg Leu Ser Gly Asn Glu Tyr Val Phe Thr Lys Asn Val His Gin 

395 400 405 

Ser His Ser His Leu Ala Met Pro He Thr He Asn Asp Val Pro 

410 415 420 

Pro Cys He Ser Gin Leu Leu Asp Asn Glu Glu Ser Trp Asp Phe 

425 430 435 

Asn He Phe Glu Leu Glu Ala He Thr His Lys Arg Pro Leu Val 
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440 








Tyr 


Leu 


Gly 


Leu 


Lys 
455 


Val 


Phe 


Ser 


Leu 


Asn 


Cys 


Ser 


Glu 


Thr 


Thr 


Leu 








470 








Glu 


Ala 


Asn 


Tyr 


His 
485 


Ser 


Ser 


Asn 


Ala 


Ala 


Asp Val 


Leu 


His 


Ala 


Thr 










500 








Arg 


Val 


Lys 


Gly 


Ser 
515 


Leu 


Asp 


Gin 


He 


Ala 


Ala 


Thr 


Val 


His 


Asp 


Val 










53 0 






Ser 


Phe 


Leu 


Cys 


Asn 


Ala 


Gly 


Ser 










545 






Asp 


Thr 


Ala 


Val 


Leu 


Glu 


Ser 


His 








560 








Leu 


Thr 


Val 


Lys 


Asp 
575 


Thr 


Lys 


Cys 


Arg 


Asn 


His 


Tyr 


Arg 
590 


Thr 


Leu 


Arg 


Leu 


Ala 


Thr 


GlU 


Met 


Thr 


Lys 


His 










605 






Val 


Asn 


Ser 


He 


Asn 


Lys 


Pro 


Met 










620 






Asp 


cys 


Glu 


Cys 


Asn 
635 


Pro 


Ala 


Gly 


He 


Leu 


He 


Lys 


Arg 
650 


Met 


Met 


He 


Pro 


Cys 


Arg 


Pro 


Leu 
665 


Asp 


Leu 


Cys 


Ser 


Glu 


Glu 


Tyr 


Phe 
680 


Ala 


Gin 


Thr 


Leu 


Pro 


Val 


Val 


Met 
695 


Pro 


Val 


Phe 


Pro 


Lys 


Ser 


Gin 


He 
710 


Ser 


Phe 


He 


Phe 


Asp 


Ala 


Trp 


Asp 
725 


Ala 


Phe 


Ala 


His 


Leu 


Ala 


Asp 


Asn 
740 


Tyr 


Lys 


His 


Lys 


Cys 


Lys 


Ser 


Leu 
755 


Arg 


Leu 


Pro 



<210> 44 
<211> 249 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2376728CD1 

<400> 44 



Met 


Val 


Asp 


Arg 


Leu 


Ala 


Asn 


Ser 


1 






5 








Ser 


He 


Val 


Glii 


Asn 


Cys 


Phe 


Gly 










20 








He 


Pro 


Gly 


Arg 


Val 


Leu 


He 


Gly 










35 








Cys 


Arg 


Lys 


Lys 


Pro 


Lys 


Ala 


Arg 










50 








He 


Leu 


Val 


Tyr 


Gly 


Asn 


He 


Val 










65 








Lys 


Gin 


His 


He 


He 


Pro 


Leu 


Glu 










80 








Lys 


Asp 


Glu 


Gly 


Asp 


Leu 


Arg 


Asn 



95 





445 










450 


Arg 


Phe 


Gly 


Val 


Cys 


Glu 


Phe 




460 










465 


Arg 


Ala 


Trp 


Phe 


Gin 


Val 


He 




475 








480 


Ala 


Tyr 


His 


Asn 


Ser 


Thr 


His 




490 










495 


Ala 


Phe 


Phe 


Leu 


Gly 


Lys 


Glu 




505 










510 


Leu 


Asp 


Glu 


Val 


Ala 


Ala 


Leu 




520 










525 


Asp 


His 


Pro 


Gly 


Arg 


Thr 


Asn 




535 










540 


Glu 


Leu 


Ala 


Val 


Leu 


Tyr 


Asn 




550 








555 


His 


Thr 


Ala 


Leu 


Ala 


Phe 


Gin 




565 










570 


Asn 


He 


Phe 


Lys 


Asn 


He 


Asp 




580 










585 


Gin 


Ala 


He 


He 


Asp 


Met 


Val 




595 








600 


Phe 


Glu 


His 


Val 


Asn 


Lys 


Phe 




610 










615 


Ala 


Ala 


Glu 


He 


Glu 


Gly 


Ser 




625 








630 


Lys 


Asn 


Phe 


Pro 


Glu 


Asn 


Gin 


640 










645 


Lys 


Cys 


Ala 


Asp 


Val 


Ala 


Asn 




655 










660 


He 


Glu 


Trp 


Ala 


Gly 


Arg 


He 




67 0 










675 


Asp 


Glu 


Glu 


Lys 


Arg 


Gin 


Gly 




685 










690 


Asp 


Arg 


Asn 


Thr 


Cys 


Ser 


He 




700 










705 


Asp 


Tyr 


Phe 


He 


Thr 


Asp 


Met 




715 










720 


His 


Leu 


Pro 


Ala 


Leu 


Met 


Gin 




730 










735 


Trp 


Lys 


Thr 


Leu 


Asp 


Asp 


Leu 




745 










750 


Ser 


Asp 


Ser 












760 












Glu 


Ala 


Asn 


Thr 


Arg 


Arg 


He 




10 










15 


Ala 


Ala 


Gly 


Gin 


Pro 


Leu 


Thr 




25 










30 


Glu 


Gly 


Val 


Leu 


Thr 


Lys 


Leu 




40 










45 


Gin 


Phe 


Phe 


Leu 


Phe 


Asn 


Asp 




55 










60 


He 


Gin 


Lys 


Lys 


Lys 


Tyr 


Asn 




70 










75 


Asn 


Val 


Thr 


He 


Asp 


Ser 


He 




85 










90 


Gly 


Trp 


Leu 


He 


Lys 


Thr 


Pro 




100 










105 
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Thr 


Lys 


Ser 


Phe 


Ala 

110 

XXV 


Glu 


Trp 


Met 


Asn 


His 








125 


Lys 


Ser 


Gly Lys 


Thr 










140 


Pro 


Asp 


Ser 


Glu 


Ala 
155 


Phe 


Thr 


Pro 


Val 


Asn 
170 


Val 


Val 


Cys 


Gly 


Pro 
185 


Gin 


Ser 


Ser 


Lys 


Pro 
200 


Leu 


Ser 


Ala 


Gly 


Asp 
215 


Ser 


Tyr 


Ser 


Gin 


Ser 
230 


Asp 


Asp 


Asp 


Asp 


Asp 
245 



<210> 45 

<211> 247 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<223> Incyte ID No: 

<400> 45 



Met 


Glu 


Thr 


Asp 


Glu 


1 








5 


Gly Glu 


Ala 


Val 


Met 










20 


Pro 


Arg 


Glu 


Gin 


Ser 
35 


Ala 


Glu 


Val 


Met 


Asp 
50 


Leu 


Pro 


Ala 


Glu 


Asp 
65 


Ser 


Lys 


Gly 


Ser 


Phe 
80 


Cys 


Ser 


Gly His 


Ser 










95 


Arg 


Lys 


Phe 


Ser 


Pro 
110 


Ser 


Val 


Ser 


Phe 


Thr 
125 


Leu 


Tyr 


Thr 


Gly 


Ala 
140 


Leu 


Leu 


Ser 


Pro 


Val 
155 


Gly 


Ser 


Val 


Gly 


Asp 
170 


Lys 


Lys 


Asp 


Glu 


Glu 
185 


Tyr 


Ala 


Val 


Leu 


Asp 
200 


Leu 


Asp 


Glu 


Glu 


Gly 
215 


Gin 


Arg 


Asp 


Arg 


Val 
230 


Val 


Cys 


Trp 


Gin 


Pro 
245 



<210> 46 
<211> 316 
<212> PRT 



Val 


Tyr 


Ala 


Ala 


Thr 

115 

X X —J 


He 


Asn 


Lys 


Cys 


Val 
13 0 


Pro 


Ser 


Asn 


Glu 


His 
145 


Thr 


Val 


Cys 


Met 


Arg 
160 


Arg 


Arg 


His 


His 


Cys 
175 


Cys 


Ser 


Glu 


Lys 


Arg 
190 


Val 


Arg 


He 


Cys 


Asp 
205 


Met 


Ala 


Thr 


Cys 


Gin 
220 


Leu 


Lys 


Ser 


Pro 


Leu 
235 


Asp 


Ser 


Ser 


Asp 





2790762CD1 



Ser 


Pro 


Ser 


Pro 


Leu 










10 


Glu 


Ser 


Arg 


Ala 


Arg 










25 


Pro 


Pro 


Pro 


Pro 


Leu 










40 


Val 


Gly 


Ser Gly 


Gly 










55 


Pro 


Phe 


Asn 


Phe 


Tyr 










70 


Ser 


Lys 


Gly Arg 


Leu 










85 


Pro 


Arg 


Thr 


Ala 


Arg 










100 


Asp 


Leu 


Lys 


Leu 


Leu 










115 


Glu 


Ser 


Cys 


Arg 


Ser 










130 


Glu 


Arg 


Asp 


Val 


Arg 










145 


Ser 


Gly Asp 


Val 


His 










160 


Gly 


Val 


Gly 


He 


Gly 










175 


Asn 


Glu 


Leu 


Asp 


Gin 










190 


Glu 


Leu 


Glu 


Asp 


Phe 










205 


Ala 


Gly Gly Phe 


Thr 










220 


Asp 


Glu 


Glu 


Ala 


Leu 










235 


Leu 


Leu 









Ala 


Thr 


Glu 


Lys 


Ser 








X ^» \J 


Thr 


Asp 


Leu 


Leu 


Ser 








135 


Ala 

A1CI 


Ala 


Val 
vox 




Val 

V AX 










150 


Cvs 


Gin 


Lys 


Ala 


Lys 










165 


Arg 


Lys 


Cys 


Glv 


Phe 










180 


Phe 


Leu 


Leu 


Pro 


Ser 










195 


Phe 


Cys 


Tyr 


Asp 


Leu 










210 


Pro 


Ala 


Arg 


Ser 


Asp 










225 


Asn 


Asp 


Met 


Ser 


Asp 










240 


IT X w 






XT X <J 


Ala 










X Zj 


i L \J 


x lie 


war xxi 


Ala 

nld 


Xtcr u 










3 0 




X-LIX 


Ser 




uiy 










*± — > 


7\ en 


vj xjf 


VJlli 














60 


Gly 


Ala 


Ser 


Leu 


Leu 












Leu 


He 


Asp 


Pro 


Asn 










His 


Ala 


Pro 


Ala 


Val 










1 0^ 


Lys 


Asp 


Val 


Lys 


He 










120 


Lys 


Asp 


Arg 


Lys 


Val 










13 5 


Ala 


Glu 


Cys 


Glv 


Leu 










150 


Ala 


Cys 


Pro 


Phe 


Gly 










165 


Gly 


Glu 


Ser 


Ala 


Asp 










180 


Glu 


Lys 


Arg 


Val 


Glu 










195 


Thr 


Asp 


Asn 


Leu 


Glu 










210 


Ala 


Lys 


Ala 


He 


Val 








225 


Asn 


Phe 


Pro 


Tyr 


Glu 



240 



45/82 
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<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 2869164CD1 



<400> 46 

Met Ala Glu Ala Ala 
1 5 
Phe Pro Ala Ala Ala 
20 

Tyr Leu Asp Lys Pro 
35 

Val Cys Pro Asn Arg 
50 

Trp Pro Ala Leu Gin 
65 

Val Gly Ser Thr Glu 
80 

Ala Asp Ala Val Arg 
95 

Arg Leu Pro Leu Ser 
110 

Gin His Pro Gly Val 
125 

Pro Ser Glu Leu Pro 
140 

Pro Trp Ala Ser Glu 
155 

Phe Trp Leu Gly Glu 
170 

His Tyr Glu Asn Leu 
185 

Leu Phe His Pro Pro 
200 

Tyr Thr Pro Ala Thr 
215 

Val Val Asp Glu Glu 
230 

Asp Pro Leu Ala Pro 
245 

Ala Gin Ala Leu Arg 
260 

Leu Pro Ala Leu Trp 
275 

He Ala Val Asn Phe 
290 

Ser Tyr Phe Gin Leu 
305 

Asp 



Leu Glu Ala Val Arg 
10 

Arg Glu Leu Cys Val 
25 

Pro Thr Pro Leu His 
40 

Pro Cys He He Arg 
55 

Lys Trp Ser Leu Pro 
70 

Val Ser Val Ala Val 
85 

Gly Asp Arg Phe Met 
100 

Phe Val Leu Asp Val 
115 

Leu Tyr Val Gin Lys 
13 0 

Gin Leu Leu Pro Asp 
145 

Ala Leu Gly Lys Met 
160 

Ala Ala Ala Val Thr 
175 

Tyr Cys Val Val Ser 
190 

Ser Asp Arg Pro Phe 
2 05 

Tyr Gin Leu Thr Glu 
220 

Ala Met Glu Lys Val 
235 

Asp Leu Ala Arg Tyr 
250 

Cys Thr Val Arg Ala 
265 

Phe His His Val Gin 
280 

Trp Tyr Asp Met Glu 
295 

Leu Asp Ser Leu Thr 
310 



Ser Glu Leu Arg Glu 
15 

Pro Leu Ala Val Pro 
30 

Phe Tyr Arg Asp Trp 
45 

Asn Ala Leu Gin His 

60 

Tyr Phe Arg Ala Thr 
75 

Thr Pro Asp Gly Tyr 

90 

Met Pro Ala Glu Arg 
105 

Leu Glu Gly Arg Ala 
120 

Gin Cys Ser Asn Leu 
13 5 

Leu Glu Ser His Val 
150 

Pro Asp Ala Val Asn 
165 

Ser Leu His Lys Asp 
180 

Gly Glu Lys His Phe 
195 

He Pro Tyr Glu Leu 
210 

Glu Gly Thr Phe Lys 
225 

Pro Trp He Pro Leu 
240 

Pro Ser Tyr Ser Gin 
255 

Gly Glu Met Leu Tyr 
270 

Gin Ser Gin Gly Cys 
285 

Tyr Asp Leu Lys Tyr 
3 00 

Lys Ala Ser Gly Leu 
315 



<210> 47 

<211> 334 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3317629CD1 



<400> 47 

Met Thr Arg Ser Leu Phe Lys Gly Asn Phe Trp Ser Ala Asp He 

1 5 10 15 

Leu Ser Thr He Gly Tyr Asp Asn He He Gin His Leu Asn Asn 

20 25 30 

Gly Arg Lys Asn Cys Lys Glu Phe Glu Asp Phe Leu Lys Glu Arg 

35 40 45 
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Ala Ala 


He 


GlU 


Glu 






Gly 








50 








Arg Lys 


Lys 


Pro 


Cys 


Gly 


Gin 


Ser 






65 








Ala Leu 


GlU 


Val 


Phe 


Lys 


Gin 


Gin 








80 








His lie 


Gin 


Leu 


Ala 


Gin 


Ser 


Leu 








95 








Glu Glu 


Phe Arg 


Glu 


Lys 


Gin 


Lys 








110 








Leu lie 


Met 


Asp 


Ala 


He 


His 


Lys 








125 








Lys Thr 


Met 


Asp 


Ala 


Lys 


Lys 


Asn 






140 








Lys Asp 


Glu 


Ala 


Glu 


Gin 


Ala 


Val 






155 








Asn Pro 


Lys 


Gin 


Gin 


Glu 


Lys 


Leu 






170 








Lys Thr 


Ala 


Val 


Glu 


Asp 


Ser 


Asp 






185 








Gly Thr 


Leu 


Asp 


Lys 


Val 


Arg 


Glu 






200 








Lys Ala 


Cys 


GlU 


Ala 


Phe 


Glu 


Ala 






215 








Phe Phe 


Arg 


Asn 


Ala 


Leu 


Trp 


Leu 








230 








Gin Cys 


Val 


Thr 


Ser 


Asp 


Glu 


Met 






245 








Leu Glu 


Met 


Cys 


Ser 


He 


Gin 


Arg 






260 








Gin Arg 


Lys 


Thr 


Gly 


Gin 


He 


Pro 




275 








Asn Phe 


Tyr 


Ser 


Ser 


Gin 


Lys 


Asn 






~i a n 








Thr Gly 


Pro 


Asn 


Leu 


Ala 


Arg 


Arg 






305 








Ser Ser 


Pro 


Asp 


Asp 


Pro 


Asn 


Tyr 








320 








Leu Leu 


Tyr 


Gin 










<210> 48 












<211> 113 












<212> PRT 












<213> Homo sapiens 








<220> 














<221> misc_f eature 








<223> Incyte ID 


No: 


3870488CD1 


<400> 48 












Met Asp 


Pro 


Lys 


Leu 


Leu 


Lys 


Gin 


1 






5 








Glu Arg 


Glu 


Phe 


Arg 


Lys 


Lys 


Phe 








20 








Val His 


Thr 


Lys 


Met 


Met 


He 


Asp 








35 








Gly Gly 


Gly 


Lys 


His 


Leu 


Gly 


He 








50 








Val He 


Glu 


Phe 


Thr 


Ser 


Asn 


Glu 








65 








Lys Gly 


Lys 


Tyr 


Gly 


Tyr 


Val 


Pro 








80 








Glu Thr 


Glu 


Val 


Tyr 


Asp 


Asp 


Val 








95 








Asn Gin 


Pro 


Leu 


Pro 


Leu 


Gly 


Arg 








110 








<210> 49 













Lys 


Asp 


Leu 


Leu 


Asn 


Leu 


ser 


55 










60 


Glu 


He 


Asn 


Thr 


Leu Lys 


Arg 




70 










75 


Val 


Asp 


Asn 


Val 


Ala 


Gin 


Cys 




85 










90 


Arg 


Glu 


Glu Ala Arg 


Lys 


Met 


100 










105 


Leu 


Gin 


Arg 


Lys 


Lys 


Thr 


GlU 




115 










120 


Gin 


Lys 


Ser 


Leu 


Gin 


Phe 


Lys 




130 










135 


Tyr 


Glu 


Gin 


Lys 


Cys 


Arg 


Asp 


145 










150 


Ser 


Arg 


Ser 


Ala 


Asn 


Leu 


Val 




160 










165 


Phe 


Val 


Lys 


Leu 


Ala 


Thr 


Ser 




175 










180 


Lys 


Ala 


Tyr 


Met 


Leu 


His 


He 




190 










195 


Glu 


Trp 


Gin 


Ser 


Glu 


His 


He 




205 










210 


Gin 


Glu 


Cys 


Glu 


Arg 


He 


Asn 




220 










225 


His 


Val 


Asn 


Gin 


Leu 


Ser 


Gin 




235 










240 


Tyr 


Glu 


Gin 


Val 


Arg 


Lys 


Ser 


250 










255 


Asp 


He 


Glu 


Tyr 


Phe 


Val 


Asn 


265 










270 


Pro 


Ala 


Pro 


He 


Met 


Tyr 


Glu 




280 










285 


Ala 


Val 


Pro Ala Gly Lys 


Ala 




295 










300 


Gly 


Pro 


Leu 


Pro 


He 


Pro 


Lys 


310 










315 


Ser 


Leu 


Val 


Asp 


Asp 


Tyr 


Ser 




325 










330 


Leu 


Arg 


Lys 


Ala 


Glu 


Lys 


Ala 




10 










15 


Lys 


Phe 


Glu Gly 


Glu 


He 


Val 




25 










30 


Pro 


Asn 


Ala 


Lys 


Thr 


Arg 


Arg 




40 










45 


Arg 


Arg 


Gly 


Glu 


He 


Leu 


Glu 




55 










60 


Glu 


Met 


Leu 


Cys 


Arg 


Asp 


Pro 




70 










75 


Arg 


Thr 


Ala 


Leu 


Leu 


Pro 


Leu 


85 










90 


Asp 


Phe 


Cys 


Asp 


Pro 


Leu 


Glu 




100 










105 
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<211> 264 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 38863 18CD1 

<400> 49 



Met 


Leu 


Gly Ala 


Glu 


Thr 


Glu 


Glu 


1 








5 








Ser 


He 


Ser 


Lys 


Arg 


Glu 


Gin 


Leu 








20 








Tyr 


Phe 


Tyr 


Val 


Pro 


Asp 


Leu 


Gly 








35 








Pro 


Ser 


Tyr 


Leu 


Pro 


Asp 


Leu 


Pro 








50 








Tvr 


He 


Ala Asp 


Leu 


Gly 


Pro 


Gly 








65 








Thr 


He 


Pro 


Glu 


Leu 
80 


Pro 


Thr 


Phe 


Leu 


Lvs 


Ala 


Asp 


Leu 
95 


Gin 


Asp 


Gly 


Pro 


Pro 


Pro 


Pro 


Pro 
110 


Pro 


Ala 


Pro 


Pro 


Leu 


Pro 


Pro 


Ser 
125 


Thr 


Ala 


Ala 


Gin 


Asp 


Asp 


Ser 


Ser 
140 


Ser 


Ser 


Ala 


Pro 


Arg 


Glu 


Val 


Val 


Asp 


Pro 


Ser 








155 








GlU 


Ser 


He 


Arg 


Gin 
170 


Ala 


Gly 


Gly 


Ser 


Met 


Lys 


Glu 


Arg 
185 


Lys 


Leu 


Glu 


Glu 


Gin 


Val 


Arg 


Ala 
200 


Thr 


Ser 


Gin 


Leu 


Phe 


Asn 


Lys 


Leu 
215 


Val 


Met 


Arg 


Gly 


Pro 


Gly Ala 


Gly 


Glu 


Gly 


Pro 








230 








Ser 


Asp 


Ser 


lie 


Pro 


Pro 


Leu 


Pro 








245 








GlU 


Glu 


Asp 


Glu 


Asp 


Asp 


Trp 


Glu 



260 
<210> 50 
<211> 185 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4043 9 34CD1 

<400> 50 



Met 
1 


Gly Gin 


Cys 


Leu 
5 


Arg 


Tyr 


Gin 


Glu 


Tyr Gin 


Ala 


Leu 


Thr 


Phe 


Leu 






20 








He 


His Asp 


Thr 


Phe 


Leu 


Lys 


Leu 






35 








Lys 


Glu Ala 


Thr 


Leu 


Thr 


Met 


Asp 






50 








Leu Arg Val 


Asn 


Pro 


Phe 


Arg 


Asp 








65 








His 


Lys Gly 


Met 


Phe 


Ser 


Phe 


Glu 



80 



Lys 


Leu 


Phe Asp 


Ala 


Pro 


Leu 


10 










15 


Glu 


Gin 


Gin 


Val 


Pro 


Glu 


Asn 




25 










30 


Gin 


Val 


Pro 


Glu 


He Asp 


Val 




40 










45 


Gly 


He 


Ala 


Asn 


Asp 


Leu 


Met 


55 










60 


He 


Ala 


Pro 


Ser 


Ala 


Pro 


Gly 




70 










75 


His 


Thr 


Glu 


Val 


Ala 


Glu 


Pro 




85 










90 


Val 


Leu 


Thr 


Pro 


Pro 


Pro 


Pro 




100 










105 


Glu 


Val 


Leu 


Ala 


Ser 


Ala 


Pro 




115 










12 0 


Pro 


Val 


Gly Gin 


Gly Ala 


Arg 




13 0 










135 


Ser 


Pro 


Ser 


Val 


Gin Gly Ala 




145 










150 


Gly Gly Arg Ala Thr 


Leu 


Leu 




160 










165 


He 


Gly 


Lys 


Ala 


Lys 


Leu 


Arg 




175 










180 


Lys 


Lys 


Gin 


Gin 


Lys 


Glu 


Gin 




190 










195 


Gly 


Gly 


His 


Leu 


Met 


Ser 


Asp 


205 










210 


Arg 


Lys 


Gly 


He 


Ser 


Gly 


Lys 




220 










225 


Gly 


Gly 


Ala 


Phe 


Ala 


Arg 


Val 




235 










240 


Pro 


Pro 


Gin 


Gin 


Pro 


Gin 


Ala 




250 










255 



Ser 



Met 


His 


Trp 


Glu Asp 


Leu 


Glu 




10 








15 


Thr 


Arg 


Asn 


Glu He 


Leu 


Cys 




25 








30 


Cys 


Pro 


Pro 


Gly Lys 


Tyr 


Tyr 


40 








45 


Gin 


Val 


Ser 


Ser Leu 


Pro 


Ala 




55 








60 


Arg 


He 


Cys 


Arg Val 


Phe 


Ser 




70 








75 


Asp 


Val 


Leu Gly Met 


Ala 


Ser 


85 








90 
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vai pne ser 


G1U 


cm 


Ala 


Cys 


Pro 


Ser 


T nu 

Lieu 


Lys 


He 


Glu 


Tyr 


Ala 






Q Q 










1 on 










1 Ub 


Prie Arg He 


Tyr 


Asp 


irne 


Asn 


bill 


Asn 


c*±y 




He Asp 


U1U 


pi n 

v»lU 






iiU 










lie 










1 O A 
1Z U 


Asp lieu v»in 


Arg 


lie 


lie 


Leu 


Arg 


Leu 


Leu 


Asn 


oGi 


Asp 


Asp 


nei. 






IOC 

1/ D 










iJ U 










1 

ijj 




T r>i 1 


JJCU, 


Mot" 


Asp 


Leu 


lill 


Asn 


nib 


Vai 


Leu 




ulU 






±*k VJ 










1 *i O 










IDU 


ber nsp jjcu 


Asp 


Asn 


Asp 


Asn 


rlG L- 


Leu 




r^iie 


Ser 


Glu 


rne 


bill 






ijj 










1DU 










1 D J 


U-j c Ala Mot- 

riis Ala neu 




Lys 


ser 


Fro 


Asp 




rlcL 


iyr 




riie 


Arg 


T 1 0 

lie 






170 










175 










180 


Arg Phe Trp 


Gly 


Cys 


























185 






















<210> 51 


























<211> 72 


























<212> PRT 


























<213> Homo sapiens 






















<220> 


























<221> misc_f eature 






















<223> Incyte ID 


No: 


4371445CD1 
















<400> 51 


























Met Phe Thr 


He 


He 


Phe 


Pro 


Val 


Cys 


Lys 


Asn 


Ser 


Met 


Pro 


vai 


1 




5 








XU 












Lys Lys Thr 


Asp 


Thr 


Asp 


Arg 


Ala 


Leu 


Ser 


Leu 


Leu 


Glu 


vjilU 


Tyr 






20 










ZD 










O U 


Cys Lys Lys 


Leu 


Arg 


Lys 


Pro 


Glu 


G1U 


Gin 


Lieu 


Leu 


Lys 


Asn 


Ala 






35 










4U 












Val Lys Lys 


Val 


Met 


Gly 


He 


Phe 


jjys 


ser 


Ser 


Leu 


Pile 


pi- 
Gill 


Ala 






50 










55 










60 


Leu Leu Gly Met 


Tyr 


Tyr 


Glu 


Ser 


Tyr 


Ser 


Ser 


Phe 












65 










/ U 












<210> 52 


























<211> 434 


























<212> PRT 


























<213> Homo sapiens 






















<220> 


























<221> mis c_f eature 






















<223> Incyte ID 


No: 


5527925CD1 
















<400> 52 


























Met Ala Ala 


Ala 


Ala 


Gly 


Ser 


Cys 


Aid 


Arg 


vai 


A±a 


Aid 


Trp 


biy 


1 




5 










1 A 

xu 












Gly Lys Leu 


Arg 


Arg 


Gly 


Leu 


Ala 


vai 


Ser 


Arg 


bin 


Ala 


vai 


Arg 






20 










Z D 










J u 


Ser Pro Gly 


Pro 


Leu 


Ala 


Ala 


Ala 


V d X 


nld 


oiy 


Ala. 


Ala 


Leu 


Ala 

nid 






35 










*± u 










Gly Ala Gly 


Ala 


Ala 


Trp 


His 


His 


Ser 


Arg 


vai 


ser 


Val 


Ala 


Ala. 






50 










jj 










O U 


Arg Asp Gly 


Ser 


Phe 


Thr 


Val 


Ser 


Aid 


oin 


Lys 


Asn 


Val 


Glu 


nib 






65 










/ u 












Gly He He 


Tyr 


He 


Gly 


Lys 


Pro 




Leu 


A T*fT 


Lys 


Gin Arg 


rile 






80 










O D 












Met Gin Phe 


Ser 


Ser 


Leu 


Glu 


His 


V3i LI 




CI 1 n 

olu 


Tyr 


Tyr 


Met 


TVi r* 
nil 






95 










i on 

1 u u 






1 f)R 


Pro Arg Asp 


Phe 


Leu 


Phe 


Ser 


Val 




r lie 


IjIU 


Gin 


Met 


Glu 


£\jl y 






110 










lie; 

113 










1Z u 


Lys Thr Ser 


Val 


Lys 


Lys 


Leu 


Thr 


Lys 


Lys 


Asp 


He Glu Asp 


Thr 






125 










130 










135 


Leu Ser Gly 


He 


Gin 


Thr 


Ala 


Gly 


Cys 


Gly 


Ser 


Thr 


Phe 


Phe 


Arg 






140 










145 










150 


Asp Leu Gly 


Asp 


Lys 


Gly 


Leu 


He 


Ser 


Tyr 


Thr 


Glu 


Tyr 


Leu 


Phe 






155 










160 










165 


Leu Leu Thr 


He 


Leu 


Thr 


Lys 


Pro 


His 


Ser 


Gly 


Phe 


His 


Val 


Ala 



49/82 
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170 175 ^80 



Phe 


Lys 


Met 


Leu 


Asp 


Thr 


Asp 


Gly Asn 


Glu 


Met 


He 


Glu 


Lys 


Arg 








185 










190 










195 


Glu 


Phe 


Phe 


Lys 


Leu 


Gin 


Lys 


He 


He 


Ser 


Lys 


Gin 


Asp 


Asp 


Leu 








200 










205 










210 


Met 


Thr 


Val 


Lys 


Thr 


Asn 


Glu 


Thr 


Gly Tyr 


Gin 


Glu 


Ala 


He 


Val 








215 










220 










225 


Lys 


Glu 


Pro 


Glu 


He 


Asn 


Thr 


Thr 


Leu 


Gin 


Met Arg 


Phe 


Phe 


Gly 








230 










235 










240 


Lys 


Arg 


Gly 


Gin 


Arg 


Lys 


Leu 


His 


Tyr 


Lys 


Glu 


Phe 


Arg 


Arg 


Phe 




245 










250 










255 


Met 


Glu 


Asn 


Leu 


Gin 


Thr 


Glu 


He 


Gin 


Glu 


Met 


Glu 


Phe 


Leu 


Gin 










260 










265 










270 


Phe 


Ser 


Lys 


Gly 


Leu 


Ser 


Phe 


Met 


Arg 


Lys 


Glu Asp 


Phe 


Ala 


Glu 








275 










280 










285 


Trp 


Leu 


Leu 


Phe 


Phe 


Thr 


Asn 


Thr 


Glu 


Asn 


Lys 


Asp 


He 


Tyr 


Trp 








290 










295 










300 


Lys 


Asn 


Val 


Arg 


Glu 


Lys 


Leu 


Ser Ala Gly 


Glu 


Ser 


He 


Ser 


Leu 








305 










310 










315 


Asp 


Glu 


Phe 


Lys 


Ser 


Phe 


Cys 


His 


Phe 


Thr 


Thr 


His 


Leu 


Glu 


Asp 






320 










325 










330 


Phe 


Ala 


He 


Ala 


Met 


Gin 


Met 


Phe 


Ser 


Leu 


Ala 


His 


Arg 


Pro 


Val 










335 










340 










345 


Arg 


Leu 


Ala 


Glu 


Phe 


Lys 


Arg 


Ala 


Val 


Lys 


Val 


Ala 


Thr 


Gly 


Gin 








350 










355 










360 


Glu 


Leu 


Ser 


Asn 


Asn 


He 


Leu 


Asp 


Thr 


Val 


Phe 


Lys 


He 


Phe 


Asp 










365 








370 










375 


Leu 


Asp 


Gly 


Asp 


Glu 


Cys 


Leu 


Ser 


His 


Glu 


Glu 


Phe 


Leu 


Gly 


Val 






380 










385 










390 


Leu 


Lys 


Asn 


Arg 


Met 


His 


Arg 


Gly 


Leu 


Trp 


val 


Pro 


Gin 


His 


Gin 








395 










400 










405 


Ser 


He 


Gin 


Glu 


Tyr 


Trp 


Lys 


Cys 


Val 


Lys 


Lys 


Glu 


Ser 


He 


Lys 










410 










415 










420 


Gly 


Val 


Lys 


GlU 


Val 


Trp 


Lys 


Gin 


Ala 


Gly 


Lys 


Gly 


Leu 


Phe 





425 430 
<210> 53 
<211> 1629 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 129042CB1 

<400> 53 

gcgacctgta tgaggaggag gaggaggagg atgtgaagat ggcggacgtg cagatgctgc 60 
tggaagagga aatcccgggg ggccgccggg ccctcttcga cagctacaca aatctggaac 12 0 
gggtggccga ttactgcgag aacaactaca tacagtcagc agataagcag agagccctag 180 
aagaaaccaa agcctacacc acccaatcct tagcaagtgt tgcctatctg ataaacacct 240 
tggccaacaa tgtcctgcag atgctggata tccaggcatc ccagctacga aggatggaat 3 00 
cttcaatcaa tcatatttca caaacagttg atattcataa agagaaagtt gcaagaagag 3 60 
aaattggtat tttgactacc aataaaaaca cttcaaggac acataagatt attgctccag 420 
ccaaccttga acgaccagtt cgttatatta gaaaacctat tgactataca attctagatg 480 
atattggaca tggagtaaag gtgagtaccc agaacatgaa gatgggtggg ctgccgcgta 540 
caacacctcc aactcagaag ccccctagtc cccctatgtc agggaaaggg acacttgggc 600 
ggcactcccc ctatcgcaca ctggagccag tgcgtcctcc agtggtacca aatgattacg 660 
tacctagccc aacccgtaat atggctccct cgcagcagag ccctgtgagg acagcttctg 720 
tgaatcaaag aaatcgaact tacagcagca gtgggagtag tggagggagc cacccaagta 78 0 
gtcggagcag cagtcgagag aacagtggaa gtggtagtgt gggggttcct attgctgttc 840 
ctactccatc tcctcccagt gtctttccag gtcatcctgt acagttctac agcatgaata 900 
ggcctgcctc tcgccatact cccccaacaa tagggggctc gttgccctat agacgccctc 960 
cttccattac ttcacaaaca agccttcaga atcagatgaa tggaggacct ttttatagcc 1020 
agaatccagt ttcagataca ccacctccac cgccacctgt ggaagaacca gtctttgatg 1080 
agtctccccc acctcctcct cctccagaag attacgaaga ggaggaagct gctgtggttg 1140 
agtatagtga tccttatgct gaagaggacc caccgtgggc tccacgttct tacttggaaa 1200 
aggttgtggc aatttatgac tatacaaaag acaaggaaga tgagctgtcc tttcaggaag 1260 
gagccattat ttatgtcatc aagaagaatg acgatggttg gtatgaggga gttatgaatg 132 0 



50/82 



WO 00/77040 



gagtgactgg 
gctcagcagg 
ccctggggat 
ttttttttgg 
ctttctgcca 
atgtatgtc 



gctttttcct 
gctgtgcttg 
tccactccag 
tttattcccc 
tcatttgtac 



^^lattacg 

cctcacagga 
taaagtagaa 
agtattaaaa 
aatgctgagc 



ttgagtctat 
atagtcaggt 
tgaaggatac 
acaaagcaag 
tgtctggatt 



catgcattat 
cttcccagat 
aaatgataaa 
ctgagtctga 
gaaataaaat 



PCT/US00/16636 



tgagtaaa 1380 
tatctgaagg 1440 
aattacactt 1500 
acaaatggat 1560 
gaccattttt 1620 
1629 



<210> 54 

<211> 1257 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> incyte ID No: 778003CB1 



<400> 54 

gcacctgatc 

tggggaaagg 

tgacaggcgc 

cccctccgca 

cgcagaagaa 

accccgcggg 

accggctcat 

ccggggagga 

ctgcaccccc 

ttcccggcag 

cagatggacc 

ggccagcaga 

ttgcagtgca 

ggagacctcc 

agaaacagcc 

tctgcaagga 

tgtctctcag 

aggtggtgct 

acagttcacg 

tcacgcagac 

tatcccaaag 



tttctcatcc 
ggcccggaga 
ccaaatggcc 
gccgcccacc 
cctggacttt 
ccctggggac 
caaggtggag 
gctggaagcc 
ggatgatggg 
aggggtgcag 
cccttctggg 
tgagtatgat 
gtttgacagt 
gcccagaagc 
gtggtttcat 
aggcagctac 
gagcagccag 
gggccaacac 
cccactgccg 
cccctgacag 
ccctcccatg 



ttccctgc tc 
agggcatgtg 
aagtggctac 
ccggactaca 
gaggacccct 
tccaagaacc 
gctgcggata 
gacactgagt 
tacatggagc 
ctctatgaca 
cagaagcctc 
cagccctggg 
ccagagtggg 
ccccagcctg 
ggccccctga 
ctagtgcggc 
ggcttcctgc 
agcgggccct 
gtgcagggtg 
tgaccctcgg 
gcctagaaaa 



ttcccttcct 
ggggcccctc 
gggactacct 
ccgagagcga 
atgaggacgc 
cc ggagatgc 
tggccagagc 
atttagaccc 
cctacgatgc 
ccccttatga 
ggcagagccg 
agtggaagaa 
agaggactcc 
cggagcgtgt 
acagggcgga 
tcagtgagac 
atctgaagtt 
tccccagcgt 
ccgagcatct 
cccccttttg 
taaataagtt 



ctccacctcc 
tgacagtggc 
gagctttggg 
catcctgagg 
ggagagccgc 
caagtatggt 
caaggccctt 
ctttgatgct 
ccaatgggtc 
ggaacaggac 
gatgccccag 
agaccacatc 
aggctcagcc 
ggacccagcc 
tgcagagagc 
cagcccccag 
cgcgcggacc 
gcccgagctc 
ggctctgctg 
agtcctcggg 
attgttaaaa 



tcctcctcct 
ccgattgggg 
ggtcggaggc 
gcctaccgcg 
ttggagccgg 
tctcccaagc 
ctgggcggcc 
cagcctcatc 
atgagtgaac 
ccagagacag 
gaagatgaac 
tccagggcgt 
aaggagctcc 
ctgcccctgg 
ctcctgtccc 
gactgctcct 
cgtgagaacc 
gtcctccact 
taccccgtgg 
cccagaatcg 
aaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1257 



<210> 55 

<211> 1527 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1418671CB1 



<400> 55 

gcttcctggg cgccgtgggc 
cgcggggaag gcgcacctgg 
gctcaaggag ggcttcctgg 
gttcatcctt cggcagaaca 
ccctcccaag ggccggatcc 
tgaaaaccga ccgctcctca 
ggcctgttct cgagaggagc 
agggcagccg gggaaggtcc 
gcacatcagc ctgcatcgca 
aagccccaac atggagcagg 
ggactggctc atctccaaca 
catgctcatg gaggagaact 
ctctggggat ctggccgagc 
gagctacaaa aagaagataa 
tggcacggtg gtgaaacaag 
ggtgcgtcgc tttgttctaa 
agaagagaac aggccagtgg 
agataatggc gttcccactg 



gcggactgcg cgggctgcgc 
ggtggccctg gcgtgcgggc 
tcaagagggg ccacattgtc 
cgctggtgta ctacaagctt 
tcctggatgg ctgcaccatc 
ttaagctgaa gactcaaaca 
gggatgcctg ggcctttgag 
agcagctgca cagcctgaga 
ttgtggacaa gatgcacgat 
gaagcaccta taaaaagacc 
gcttcacggc cagccgtctg 
tcctcaggcc tgtgggtgtc 
agttcctgga tgactccaca 
gccccaagga agaaattagc 
gctacctggc caagcaggga 
ggaaggat cc agctttcctg 
g tgggt 1 1 1 c t c t tcgt ggt 
gggttaaagg gaatgtccag 



gggtgccgag gagcgcgagg 60 
ggcgacatgg aggacggcgt 120 
cacaactgga aggcgcgatg 180 
gaggggggtc ggagagtgac 240 
acctgcccct gcctggagta 3 00 
tccacggagt acttcctgga 3 60 
atcaccgggg ctattcatgc 42 0 
aactccttca agctgccccc 480 
agcaacaccg gaatccgttc 540 
ttcctcggct cctccctggt 600 
gaggcggtga ccctggcctc 660 
cgaagcatgg gagccattcg 720 
gccctgtaca cttttgctga 780 
ctgagcactg tggagttaag 840 
cacaagagga aaaactggaa 900 
cattactatg acccttccaa 960 
tcactcgtgt ctgctctgga 1020 
ggaaacctct tcaaagtgat 1080 
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tactaaggat gacacacactlRacattca ggccagcagc aaggctgagc gagccgagtg 1140 

gattgaagct atcaaaaagc taacatgaca aggacctgag ggaaccagga ttcctccctc 1200 

ctaccagatg acacagacaa gagttcctgg agaatgggag tgttaagact tttgacttct 1260 

ttgtaagttt tgtactgctt tggagagtga atgctgccaa gagttcctca gattacaaac 1320 

agcagtggtg ccatttcctt ccccatcttc atgttacaaa cctggaaagg ctagaacagc 1380 

cattaggcgt cagcatcttg acttttcccc agcatcacaa acagccattt cctcgggcac 1440 

caaagtaggt tccctttgtt ggaacaatta cactggccat gccataatgt tgaataaaac 1500 

tctcttctta tgaaaaaaaa aaaaaaa 1527 

<210> 56 
<211> 2220 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> incyte ID No: 1456841CB1 

<400> 56 

ctgctgtcct tccaccacca gcaccggacc acctgctcca agaccagcct cctgggggga 60 
ccacgcaccc ggccttcact ggcacccagg gagccgtcct cagcagcgtc aacatgtcaa 12 0 
ggcccagcag cagagccatt tacttgcacc ggaaggagta ctcccagaac ctcacctcag 180 
agcccaccct cctgcagcac agggtggagc acttgatgac atgcaagcag gggagtcaga 240 
gagtccaggg gcccgaggat gccttgcaga agctgttcga gatggatgca cagggccggg 3 00 
tgtggagcca agacttgatc ctgcaggtca gggacggctg gctgcagctg ctggacattg 3 60 
agaccaagga ggagctggac tcttaccgcc tagacagcat ccaggccatg aatgtggcgc 420 
tcaacacatg ttcctacaac tccatcctgt ccatcaccgt gcaggagccg ggcctgccag 480 
gcactagcac tctgctcttc cagtgccagg aagtgggggc agagcgactg aagaccagcc 540 
tgcagaaggc tctggaggaa gagctggagc aaagacctcg acttggaggc cttcagccaa 600 
gccaggacag atggaggggg cctgctatgg aaaggccgct ccctatggag caggcacgct 660 
atctggagcc ggggatccct ccagaacagc cccaccagag gaccctagag cacagcctcc 720 
caccatcccc aaggcccctg ccacgccaca ccagtgcccg agaaccaagt gcctttactc 78 0 
tgcctcctcc aaggcggtcc tcttcccccg aggacccaga gagggacgag gaagtgctga 840 
accatgtcct aagggacatt gage tgt tea tgggaaagct ggagaaggee caggcaaaga 900 
ccagcaggaa gaagaaattt gggaaaaaaa acaaggacca gggaggtctc acccaggcac 9 60 
agtacattga ctgcttccag aagatcaagt acagcttcaa cctcctggga aggctggeca 1020 
cctggctgaa ggagacaagt gcccctgagc tegtacacat cctcttcaag tccctgaact 1080 
tcatcctggc caggtgccct gaggctggee tagcagccca agtgatctca cccctcctca 1140 
cccctaaagc tatcaacctg ctacagtcct gtctaagccc acctgagagt aacctttgga 1200 
tggggttggg cccagcctgg accactagcc gggccgactg gaeaggegat gagcccctgc 1260 
cctaccaacc cacattctcg gatgactggc aacttccaga gccctccagc caagcaccct 1320 
taggatacca ggaccctgtt tcccttcggc ggggaagtca taggttaggg agcacctcac 1380 
actttcctca ggagaagaca cacaaccatg accctcagcc tggggacccc aactccaggc 1440 
cctccagccc caaacctgcc cagccagccc tgaaaatgea agtcttgtac gagtttgaag 1500 
ctaggaaccc aegggaactg actgtggtcc agggagagaa gctggaggtt ctggaccaca 1560 
geaageggtg gtggctggtg aagaatgagg egggaeggag eggctacatt ccaagcaaca 1620 
tcctggagcc cctacagccg gggacccctg ggacccaggg ccagtcaccc tetegggtte 1680 
caatgetteg acttagctcg aggectgaag aggtcacaga ctggctgcag gcagagaact 1740 
tctccactgc caeggtgagg acacttgggt ccctgacggg gagecagcta ettegcataa 1800 
gacctgggga gctacagatg ctatgtccac aggaggcccc acgaatcctg tcccggctgg 1860 
aggctgtcag aaggatgctg gggataagee cttaggcacc agcttagaca cctccaagaa 1920 
ccaggccccg ctgatgcaag atggcagatc tgatacccat tagagccccg agaattcctc 1980 
ttctggatcc cagtttgcag caaaccccac accccagctc acacagcaaa aacaatggac 2040 
aggeccagag ggtgaagcaa acagtgtccc ttctggctgt gttggagcct ccccagtaac 2100 
cacctattta ttttacctct ttcccaaacc tggagcattt atgectagge ttgtcaagaa 2160 
tctgttcagt ccctctcctt ctcaataaaa gcatcttcaa gcttgtaaaa aaaaaaaaaa 2220 

<210> 57 
<211> 2895 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2020010CB1 



<400> 57 



52/82 



WO 00/77040 



PCT/US00/16636 



ccaggcccga agccgaggcg gggccgggat gcggcgctga ggcccagcat ggccggcccg 60 
ggccccacct tcccgctgca ccggctcgtc tgggcgaacc ggcatcgcga actggaggcc 120 
gcactgcaca gccaccagca cgacattgaa caggaggacc cccgcgggcg gaccccactg 180 
gagctggccg tgtctctggg aaacctggag tctgtgagag tgctccttcg acacaatgcc 240 
aacgtgggca aagagaaccg ccagggctgg gcagtcctgc aggaggcagt cagcactgga 300 
gaccccgaga tggtgcagct ggtgctccag tatcgggact accagagggc cacgcagagg 360 
ctggcgggca ttccggaact gctcaacaaa cttcgccagg cccccgattt ctacgttgag 420 
atgaagtggg agttcaccag ctgggtgccc cttgtgtcta agatgtgccc aagcgatgtg 480 
taccgcgtgt ggaagcgggg tgagagcctg cgagtagaca ccagtctcct gggcttcgag 540 
cacatgacct ggcagcgggg ccggaggagc ttcatcttca agggccagga ggcaggagcc 600 
ctggtgatgg aagtggacca tgaccggcag gtggtgcatg tggagacact ggggctcact 660 
ctgcaggagc ccgaaacact gctggccgcc atgcggccca gcgaggagca tgtggccagt 720 
cgcctcacct ctcctatcgt ctccacccac ctggacactc gtaatgtggc ctttgagagg 780 
aacaaatgtg gtatctgggg ctggcggtct gagaagatgg aaactgttag cggctacgag 840 
gccaaggtgt acagtgccac caacgtggag ctggtgacac gcacacgcac ggagcacctc 900 
tctgatcagg acaagtcgag gagcaaagcg gggaagactc cattccagtc cttcctgggg 960 
atggcgcagc agcattcctc ccacaccggg gcccccgtgc agcaggcagc cagccccacc 1020 
aaccccacag ccatctcccc tgaggagtac ttcgacccca acttcagcct ggagtcacgg 1080 
aacattggcc gccccatcga gatgtccagc aaagtacaga ggtgaggtct gagagctggc 1140 
tggggacttg cc tcgggaca agggctcttg cagacccctc tctgggcctg tcatagttag 1200 
ggacccactc cctcggctgg cttctctctg gactccactc ctggagggca ggagtcatgt 1260 
ctgccttatc catggtcctc agcccttagc aaggggcttg gcacagaaaa ccggtagttt 132 0 
gtctttgatg aatggatggt gccacccatg tatggttttc tattgaattt catgagtacc 13 80 
tgctggggcc agcgtggcac agtgggaagg gcccccagcg acgtggcctg ggagtggagc 1440 
gtctgccttg gtggaacaga acagttacgc tcttgcctcg tgaggagcct tgagcaggta 1500 
taggcggtaa cagcaggcac agtgcctgca caagcctagc gctctcccca ccatctcagg 1560 
aggattctct tagccaccca acagtcgctg ggattcgaac cctggcagtc ttgccctggg 1620 
gtgtgctgga cataaaagtt tggcaacacg tgagctggtg accagctctg ggctgaggag 1680 
gaaaacgggg ctgtgggcca ggcccagaga gaagcccaca gcctggcacc tggcctcctt 1740 
gtccagacca gatgagctgg tacgaaatcc tcaggagccc cagcctgggc ccagggagga 1800 
gggcagcttg ggccacgtgg ccaggacacc agctcccggg ggaggcgggc agcggcatct 1860 
gagcagagca gggcactcca ggccaggcag aaggtgggta aaggcagctg cccacgaacc 1920 
agagggcagt cctcaatgga agggccacca gccgtgcctc acccatgtcc tgtggtcggc 1980 
tgggcaggtt caaggcaaca ctgtggctga gtgaagagca cccgctctcc ctgggtgacc 2040 
aggtgacccc catcatcgac ctaatggcca tcagcaacgc tcactttgcc aagctgcgcg 2100 
acttcatcac tctgcgcctt ccacctggct tccccgtcaa aattgagatt ccccttttcc 2160 
acgtgctcaa tgcccgcatc accttcagca acctgtgtgg ctgtgatgag cccctgagct 2220 
ccgtgtgggt gccggccccc agctctgctg tcgccgcatc agggaaccct ttcccgtgcg 2280 
aggtggaccc caccgtgttt gaagtgccca acgggtacag cgtgctgggc atggagcgca 2340 
acgagcccct ccgggacgag gacgatgacc tcctgcagtt cgccatccag cagagcctgc 2400 
ttgaagcggg cactgaggcg gagcaggtga ccgtctggga agccctgacc aacacccggc 2460 
ccggtgcccg ccctcctccc caggccacgg tttatgagga acagcttcag ctggagcggg 2520 
ccctccagga aagcctgcag ctgtccacag agcccagggg cccaggatcc cctcccagga 2580 
cacccccagc ccccggtcca cccagctttg aagagcagct gcgcctggcc ctggagttgt 2640 
cttcacggga gcaggaggag cgggagcggc gcgggcagca ggaggaggag gacttacagc 27 00 
ggatcctgca gctgtcactc actgagcact gagccatagc cccgggaggg ctggccaggc 2760 
cactccctgc ccgcttttgt aatttattta tttataaact ctctgctgct gagcttgggg 282 0 
cctggagccc caggaatgag caggcagggg agactgagat ggaaataaag agactgtcgc 2880 
agcaaaaaaa aaaaa 2895 

<210> 58 
<211> 2801 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No; 2149037CB1 



<400> 58 

gcgccgctga gcgctgactg ggtgcgagtg 
ctgaggtggc ccgtggggca gggcagatga 
ccagctctaa gatagcagga taggagactt 
cccaccagca caatgtcagg aagccataca 
ccgagcatat ggccccagga gatcttggcc 
caaccagagt tctactacga tgagtttggt 
cctggctcca gtctgctggc gaactcccct 



gggaagctgc taacccgacc cggattggcg 60 
ttctggacca gatgaagcct gaggagcctt 120 
ctaagattgg agctgcagaa gacttgccag 180 
cctgcctgtg gccctttctc agccctgact 240 
aagtacacgc agaaggaaga gtcagcagag 3 00 
ttccgtgtgt acaaggaaga aggtgatgag 360 
ctgatggagg atgctccaca gaggctgcgg 42 0 



53/82 
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tggcaggccc 
attgccgtct 
catggcatga 
tctgagctgt 
aagcagatcg 
ggtagcatcg 
gagatcggct 
gaggaggacg 
ttcagcacca 
cagtacctgc 
acactgcact 
atctgggacc 
ctgcacctca 
ctatcggata 
ctggccggct 
ctcattgcag 
gttcgccgca 
gacctggagg 
gaagccatcc 
gagctgactc 
tgctcacgca 
gacgagctgg 
tgctgggtgg 
ctggatgagc 
acagacctcg 
aagaagccat 
ggccgggagg 
ttcaggttgg 
cagtctgtga 
ctgatctgcg 
agcctgccca 
gtccagatca 
tgggagctcc 
ctggtgaagc 
aacctcgggc 
agctccagag 
ccccagcaca 
atgctctagg 
tctgtgctca 
ttataaataa 



acctggagtt 
ccctaccccg 
ggccacagct 
cctaccgcga 
agaaggacct 
gggtgccccg 
actgccaggg 
ccttctggat 
ccctgctggg 
ctcgcctgga 
ggttcctcac 
tgtttttcta 
aggaggaaga 
tcccgtcgca 
ccctcaccga 
accagggcca 
ggacccagcg 
cactcaaggc 
tgcgcgtggc 
cagactatag 
gccaccggcg 
gcttccgcaa 
gggagctcaa 
gcagcaaaga 
tgcgagggac 
ccctgcttgg 
tcgagagaga 
atgaagatgg 
acgtgaccca 
tggggctcaa 
ccgtggagaa 
agtgtgagct 
ctgcgaagag 
accacctctt 
ctgcgtctga 
ccctggccgg 
tcccaggtgg 
ccaaaccaca 
ggaagaggga 
actgtgtctg 



PCT/USOO/16636 



eccataac 
ctctgagaag 
gtggatgcgg 
gattgtgaag 
gctccgcacc 
cctgcgcagg 
caccggcatg 
gatgtctgcc 
tgtccagact 
caagctgctc 
ggccttcgcc 
cgagggctcc 
gctgatccag 
gatggaggac 
tgtggccgtg 
gctcctgggg 
gaggaagtcc 
caagaacatc 
acgccacttc 
catggagagc 
ccgagccaag 
gaacgacatc 
cggcctgcga 
gtactccatc 
cctctgcccg 
gggcgcctgc 
ctttgcctcc 
caaagtcctg 
cgatgcagtg 
tgagcaggtg 
gtggtaccag 
ccgagtcctc 
agaggcgcag 
cagctgggat 
ggtggcccag 
ggccgcggga 
tgggagcaga 
gtttgtacca 
aggggatggg 
tctttgagaa 



cacgatgtgg 
ctccgctccc 
ctctctgggg 
aacagctcca 
atgcccagca 
gtgctccggg 
gtggccgcct 
atcatcgagg 
gaccagcggg 
caggagcatg 
agcgtggtgg 
cgggtgctgt 
tcagagaact 
gcggagctgc 
gagactcagc 
gccggcaccc 
accatcactg 
aagcagacgg 
cagtgcacag 
caccagcggg 
gccctgctgg 
atcacaatcg 
ggctggtttc 
gcgggggatg 
gcccttaagg 
cacccctggc 
gtgtattccc 
accccggagg 
catgcacaaa 
ctgcacctgt 
ccctggtcct 
tgctgctttg 
cagcccctga 
gtggacgggt 
gaccccaagc 
tatcaatatc 
gggtaccctg 
aaaaccttgt 
ggtggctagt 
aaaaaaaaaa 



gggatctcac 
tggtgctggc 
ccctgcagaa 
acgatgagac 
acgcctgctt 
ccctggcctg 
gcctcctgct 
acctgctccc 
tcctgcgcca 
acattgagct 
acatcaagct 
tccagctcac 
cggcctccat 
ttctgggggt 
gccgcaagca 
tcaccaacct 
ctctgctctt 
aactggtggc 
accccaaaaa 
accacgagaa 
actttgagcg 
tgtctcagaa 
cagccaagtt 
actcggtgac 
ccctgttcga 
tgtttatcga 
gtctggtgct 
agctgctcta 
tggatgtgaa 
ggctggaggt 
tcctgcgcag 
ccttcagcct 
aggagggcgt 
gaccccctcc 
tgcagagccc 
aggctgcccc 
ccccaccagg 
gaggaggtgg 
aggctcctgg 



ctgggacaag 

cggcatccca 

gaagaggaac 

catcgctgcc 

cgccagcatg 

gctctaccca 

gttcctggag 

cgcctcctac 

cctcattgtc 

gtccctgatc 

gctcctgcgc 

gctgggcatg 

cttcaacacg 

ggccatgcgg 

cctggcctat 

ctctcaggtt 

cggggaggat 

tgacctccgg 

ctgcagcgtg 

ctacgtggcg 

gcacgacgac 

ggacgagcac 

cgtggaagtc 

ggagggggtc 

acatggactg 

ggaggctgca 

ctgtaagacc 

ccgggctgtg 

gctccgctca 

gctctgctcc 

cccgggctgg 

ctcccaggac 

ccgggacatg 

tccccagccc 

agggaagagc 

actccacgtt 

gtccttaggg 

gggagccatg 

cctctttggt 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2801 



<210> 59 

<211> 599 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<223> Incyte ID No : 



2162179CB1 



<400> 59 

aggttatcgt 

acagggaaga 

ttgaccatga 

taggatgctg 

aacccactgg 

aaagaaaata 

cagctaaacg 

agtgtgattt 

atgcgaaagt 

ccagcctggg 



taggcatctc 
gataatagta 
gtcgaataat 
tcctacggaa 
atacattcga 
cagaccaatt 
tgggtttctt 
attacttatc 
tataggggat 
cgacagagca 



ccaggcgacc 
gcagaatttc 
acagtggatg 
ggagagctgc 
ttcgaaaaat 
ccagaagatg 
actaaggacg 
acaatgactt 
actttaaaat 
agactccatc 



ggctccgcag 
acaaaaaaat 
tgagagagat 
atgatctgat 
ttcttccggt 
tccttcttcg 
agctgatcaa 
atgtgaggaa 
cagtcattct 
tcacaaaaaa 



caagatggcg 
caaagaggca 
tggaacaatt 
tgcagaggta 
gatgacagaa 
agcttttgag 
gtatatgact 
ttaataattt 
ggtgaaagtt 
aaaaaaaggg 



gacgagaagg 60 
tttgaagtct 120 
atcaggtcat 180 
gaggaagaag 240 
atactactag 3 00 
gttttagatt 3 60 
gaagaaggta 42 0 
gttaacagtt 480 
atttactgtt 540 
gggggaggg 599 



<210> 60 

<211> 2065 

<212> DNA 

<213> Homo sapiens 
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<220> 
<221> misc_f eature 
<223> Incyte ID No: 22447 06CB1 

<220> 

<221> unsure 

<222> 2060-2061 

<223> a, t, c, g, or other 

<400> 60 

gtaattactg gaaccacaga aaattcacct gcagatcggt gcaagaaaat ccatgctggc 60 

gatgaagtga ttcaagttaa tcatcagact gtgcctctta tacctagaag tcccacaagc 120 

agcgttgcca cgccttccag caccatcagt acacccacca aaagagacag ttctgccctc 180 

caggatctct acattccccc tcctcctgca gaaccatata ttcccaggga tgaaaaagga 240 

aaccttcctt gtgaagacct cagaggacat atggtgggca agccagtgca taagggatct 3 00 

gaatcaccaa attcatttct ggatcaggaa tatcgaaaga gatttaatat tgtcgaagaa 3 60 

gatactgtct tatattgcta tgaatatgaa aaaggaagat caagtagtca aggaagacga 420 

gaaagcaccc caacttatgg caagctacga cctatatcta tgccagtgga atataattgg 480 

gtgggggact atgaagatcc aaataagatg aagagagata gtagaagaga aaactctcta 540 

cttcggtata tgagcaatga aaagattgct caagaagaat acatgtttca gagaaacagc 600 

aaaaaggaca cagggaagaa gtcaaaaaag aagggtgata agagtaatag cccaactcac 660 

tattcattgc tacctagttt acaaatggat gcactgagac aagacatcat gggcactcct 720 

gtgccagaga ccacactata ccatacattt cagcagtcct cactgcagca caaatcaaag 780 

aagaaaaaca aaggtcctat agcaggcaag agcaaaagac gaatttcttg caaagatctt 840 

ggccgtggtg actgtgaggg ctggctttgg aaaaagaaag atgcgaagag ttacttttca 900 

cagaaatgga aaaaatattg gtttgtccta aaggatgcat ccctttattg gtatattaat 9 60 

gaggaggatg aaaaagcaga aggattcatt agcctgcctg aatttaaaat tgatagagcc 1020 

agtgaatgcc gcaaaaaata tgcattcaaa gcctgtcatc ctaaaatcaa aagcttttat 1080 

tttgctgctg aacatcttga tgatatgaac aggtggctta acagaattaa tatgctgact 1140 

gcaggatatg cagaaagaga gaggattaag caggaacaag attactggag tgagagtgac 1200 

aaggaagaag cagatactcc atcaacacca aaacaagata gccctccacc cccatatgat 1260 

acatacccac gacctccctc gatgagttgc gccagtcctt atgtggaagc aaaacatagc 1320 

cgactttcct ccacggagac ttctcagtct cagtcttctc atgaggagtt tcgccaggaa 13 80 

gtaactggga gcagtgcagt gtctcccatt cgcaagacag ccagtcagcg ccgctcctgg 1440 

caggatttaa ttgagacgcc actgacaagt tcaggcttac actatcttca gactctgccc 1500 

ctggaggatt ctgtcttctc tgactccgcg gccatctccc cagagcacag gcggcagtct 1560 

accctgccaa ctcagaaatg ccacctgcag gatcactatg ggccataccc cttagctgag 1620 

agtgagatga tgcaagtgct aaatggaaat gggggcaagc ctcgaaggtt tactctgcct 1680 

cgagatagcg ggttcaacca ttgctgtctg aatgctccag ttagtgcctg tgacccacag 1740 

gatgacgtgc aaccaccaga ggtggaggaa gaggaggacg atgaggagga agcatgggag 1800 

gcagccggtg gaaacatggg agaaaaaagc ctattcactg cgagagtggg tagacctttc 1860 

atgcaaaacg gatccactct gtggcactaa ccattggact acagattata ccaatgttag 192 0 

agattagatt ggaaattggg gtgcggattc ggttaaccta tggtacctgg ctctaggggg 1980 

gccgtccaat tgcctcagta gtgtttcgcg gcccggcgtg tttaaagtgt atgggaactg 204 0 

ggtcccatta gcctgagcan ncccc 2065 

<210> 61 

<211> 2330 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 23168 05CB1 

<400> 61 

gctttgcaga gtgattatca gcacagttcc ctgccctgga taaggaacag ctacagtcgc 60 

tgttaaatgt gcctgaaaag caatttgcaa tctttgcatt aggatttcag atgcatgcca 120 

ggtttccact gattgccaga actcgagatc actacacatg gatccccaaa atcaacatgg 180 

cagtggcagt tcgttagttg tgatccagca gccttctttg gatagccgtc agagattaga 240 

ctatgagaga gagattcagc ctactgctat tttgtcctta gaccagatca aggccataag 300 

aggcagcaat gaatacacag aagggccttc ggtggtgaaa agacctgctc ctcggacagc 3 60 

accaagacaa gaaaagcatg aaaggactca tgaaatcata ccaattaatg tgaataataa 42 0 

ctacgagcac agacacacaa gccacctggg acatgcagta ctcccaagta atgccagggg 480 

ccccattttg agcagatcaa ccagcactgg aagtgcagcc agctctggga gcaacagcag 540 

tgcctcttct gaacagggac tgttaggaag gtcaccacca accagaccag tccctggtca 600 

taggtctgaa agggcaatcc ggacccagcc caagcaactg attgtggatg acttgaaggg 660 
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ttccttgaaa 
gtgtggagaa 
cctttgctct 
cttctaccac 
ttcacaatca 
ttgcttactc 
ctggatccat 
gagctgcccc 
ctgaacttct 
ttagaatttt 
tttttctctt 
cccactttca 
gtgtagcctt 
atccagacta 
ttttttaaaa 
tagggctatc 
acgtaaaaat 
cctcagtttt 
ttttagattc 
ccatcttcac 
tgcttctcac 
tagctgatca 
atggtccttt 
tccttgcctt 
tgaattctga 
attatgcgaa 
tgtttgaata 
atatgttaac 



gaggacctga 
tgcactgctc 
gctgagagca 
tgctccaatg 
cactgctgct 
tgttatcctc 
cgcccagggt 
tcccggggtc 
agctttcaag 
tccctgtttc 
tcctcatgga 
acaagagcct 
tttgttctgc 
cagtgattta 
agcaactgtt 
ttcctagcat 
tttcctttaa 
gttgttgtct 
gccttacaat 
tctagccttc 
caaatttttg 
atctatatgt 
tggccccttg 
aaccacaaat 
ctatttttaa 
tctctattat 
taacaaaata 
tataatgcag 



:agcacaa 
ccaggaccct 
tggtggaata 
acgacgaagg 
ctagatacct 
ctgctaaagg 
gcagatgtaa 
agggtaaacc 
ttgtggctgt 
ccaccttctc 
tgatcttcag 
ctgccatcca 
aagcaacttt 
gagttgtttt 
taattgctta 
aggcccctta 
ccactgccct 
tactctggag 
gtaaatcttc 
gtatttgtga 
ttgtcattga 
gtcacagaac 
gatagttaac 
tgtggtgctt 
gacaaaagtc 
tttatggtat 
tcaatactta 
aaaatatatt 



PCT/US00/16636 



gttcatttgt gaacagtgtg"7jgaagtgcaa 720 
accatcctgt ttggcctgta accggcagtg 780 
tggaacctgc atgtgcttag tcaagggcat 840 
ggattcctat tcagataatc cttgctcctg 900 
gtgtatggga gccatgtctt tatttttacc 960 
atgcctgaag ctgtgcagga ggtgttatga 1020 
gaactccaac actgtctatt gtaagctgga 1080 
atcatgattt ttggaggtgg gttgtacctc 1140 
tttttgtttt tgtttttgtt tttgttttct 1200 
ttcccctgtt gccaaggtct aactcatgga 1260 
caagagtgga ctgggaagct gcacctggct 1320 
cttgagggta ttgagagcca gtgggctttt 1380 
ctaaagttgt gtacatgaac atacacccac 1440 
gattgggtac cgtgggagca gggaaattgg 1500 
aataagctat gtattaaatc tgtctccagt 1560 
agtagcatgg gggatatatt ttttgctata 1620 
ctccttcttt ctccttcaag gttctttccc 1680 
atgccaagtg tattttttct ttctatgtaa 1740 
acattggaga taatattggt tggaccttgc 1800 
aggactcagc caccttcctt cttcacccca 1860 
gggcacttgg ataactcaag ttgatattta 1920 
tatgctgcct aaagtgatct tggctcctta 1980 
agctgagtaa ttctaatctc ttctgtgttt 2040 
tttgtatatt ttatgtataa atcacaaagt 2100 
tgttaaactt ttttattgta aagaatattt 2160 
ttattgcaaa agactgttga aatgtactca 2220 
acggaaaata aggtgacacg aagaaagtac 2280 
aattaatgaa aaaaaaaaaa 233 0 



<210> 62 

<211> 2610 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2320010CB1 



<400> 62 

aagtatgctt 

cctggctgtc 

acagatcagg 

cacttaatca 

cgaatacagc 

gcttcaagac 

atatacatct 

cctcactgag 

tgtgtgctgc 

ggaatgaagc 

ggaaaacctt 

atgtagcact 

caaatagtga 

agagaatccg 

aaatgctaat 

ggaaaacctg 

ctgcgatgtg 

tatttcctaa 

caatgaaaga 

tgctagaaca 

tttccaagtt 

tttatagaac 

gacttttagg 

atggaaagga 

gacttgttaa 

cagacaaggt 

tgaagcctta 

ataatatcag 



gtttcacttt 
aaaacacttc 
agtaatgcag 
cagttctaca 
tactgaagtt 
ccaactttta 
tgcttttctt 
agctttcatc 
attttgctat 
atctgcactt 
tttgaggaca 
aagcggagga 
cagacctatg 
cactgttctt 
tgatctccag 
gcttgatagc 
ttatgtccat 
cggatgttca 
agatgctggg 
atgtgtggat 
gatcgttcca 
tcttcatgga 
cactttcttc 
gtacatctat 
actttatggt 
aaatgccaaa 
ctttgatgac 
cagatttgtt 



agatatatgg 
ggaatagacc 
gcccggcttc 
acaactgaag 
tccctaacag 
aataatgatg 
aaaaatggac 
agtaagtttc 
gaggttttaa 
ttgtatcttt 
catctacaga 
tcaagatttc 
aaggcaactg 
atggccactg 
tatagcttag 
atggccaaga 
gtagcagctc 
gcgttcaaga 
atgatggatg 
ggcttatgga 
atttatgaga 
gcttacacaa 
agagttgcct 
aaagaaccaa 
gaaaagtttg 
gagcttgatc 
aaagaactca 
tttgaggccc 



ggaaaagaaa 
gaaaatcgca 
agcatcttag 
cagacatttt 
tactagacac 
gccataaccc 
aatctgaagt 
cttcagcatt 
agtgctgcac 
tgatgagaaa 
taataattgc 
aggagtcttt 
cctttcccgc 
cccaaatgaa 
ccaagtccta 
ttcatgtaaa 
tagttgcaga 
aaattactcc 
tccattatag 
aggcagaacg 
aacgtcgtga 
aaattctgga 
tttatggcca 
agctcactgg 
gtacggagaa 
caaaatatgc 
cagaaaggaa 
cttacacttt 



catagcaagg 
aaccatgcct 
tagcctagaa 
ccaccaggca 
catatcattt 
attaatgaaa 
gtcgctgaaa 
tttcaaagga 
atcgaagatt 
caactttgag 
tgtaagccaa 
attcattatc 
agaagtcaaa 
ggagcatgag 
tgcaagcacc 
aaatggagat 
gtttcttcat 
caatatagat 
tgaagaggtc 
ttatgaaata 
gtttgagaaa 
agttatgcat 
atcttttttt 
cc tctcagaa 
tgtcaaaata 
tcatatacaa 
gaccgagttt 
atcaggcaaa 



gtgcatgatg 
gctcttcgaa 
agttcattta 
cttcttgccg 
ttcactcagt 
aaagtgtttg 
catgtatttg 
agagtaaaca 
agctcaacca 
tataccaaaa 
ctgatagctg 
aataattttg 
gacttgacca 
aaagaccctg 
ccagagctca 
ttttcagagg 
cgaaaaaaat 
gaagaaggag 
ttgctggagt 
atttctgaga 
cttactcaag 
acaaaaaaga 
gaagaagaag 
atttccttga 
attcaggatt 
gttacttatg 
gaaagaaatc 
aaacagggct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 



56/82 



WO 00/77040 



gtatagaaga 
tgaagaagag 
ccactgatga 
tggacatgat 
gtccattagc 
agaaagtgag 
ttgaactaaa 
caaatttcag 
aagacacaat 
gtacatcaag 
atgtacgtga 
atttaaagat 
agcagggtgc 
tgtgtttggt 
gtagttcaat 
aatacaataa 



acagtgcaaa 
gattcctatt 
aataaaagat 
tcagctccaa 
atatgcaaga 
tgagttgaaa 
tgagcggcta 
agacatggta 
gcattctccc 
tgaccgaggt 
caatgagact 
ttgatataca 
ttacatattt 
ttgtactttt 
ttgcttaatt 
aaggtaaacc 



cgccc 
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:cgtacaa 
aac tgtgaac 
aaaactgcag 
cttaaattgc 
gctttcttaa 
gacatgttta 
attaaagaag 
aaagaattat 
tggatgagca 
tatggttccc 
gacctttctc 
tggagtgttt 
gtaaataagc 
ttaggtaaat 
attgcttaaa 
ttaaaacaaa 



tcttgacaac 
agcagattaa 
agctgcaaaa 
agggctgtgt 
atgacagcca 
ggaaatttat 
atcaagttga 
ctgacattat 
acacattaca 
caagatacgc 
aggaatattt 
cttctcgaca 
aacttgaaag 
ctatatgctg 
ataatggtac 



ttcaaactcg 
tttaaaacca 
gctttgctcc 
ttctgtgcag 
agctagcaag 
acaagcatgc 
gtaccatgaa 
ccatgagcag 
tgtattttgt 
tgaagtgtga 
ggagctgtgc 
ccaaaatttt 
tgcctggaaa 
aaaagtagag 
tatgtaaaat 



tttccttacg 
attgatgttg 
tctactgacg 
gtcaatgctg 
tatccaccta 
agcattgcac 
gggctaaagt 
atattacaag 
gcaattagtg 
ggcaatgcag 
aaatgttaaa 
catgtgttcc 
attgcaccac 
ctcaaaaaca 
tgtataatgg 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2610 



<210> 63 

<211> 1035 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 



2564901CB1 



<400> 63 

agtccacctt 

ttcccacatg 

cctccagttg 

gcactgctac 

caacattaac 

tgatgggaga 

tacggatttt 

ctcagaaacc 

caagaagatg 

gctgattgag 

atacatccgg 

catccagggt 

gtacttccag 

gatcccaaaa 

aaagtccctg 

gagtggcctg 

cgagtcacct 

aaaaaaaaaa 



gcgaccgtat 
cagggcagca 
ctgatgaccc 
aaggtccatg 
agtgtcttgg 
cccatttatg 
gcagagaatg 
ggctttgcgt 
aggaagaagg 
aaggaagggg 
gagacgtacc 
caaagctgcg 
tcgaatgctg 
gtcttcgacc 
cggtccaggc 
atcgccacag 
ctttcgtctt 
aaaaa 



ccgctagcgc 
caaggagaat 
atggcgtgct 
accgcaatgc 
agtccttgta 
cgttggtgaa 
aactggattt 
cttccacaaa 
aagcggagca 
agttcaccct 
ccgacgcggt 
aaacctgtgg 
aaccgcgctg 
ctgagaagga 
agcattagcc 
cccttcttgg 
aatgttcacc 



ggcctgggat 
gggcgtcatg 
agaggaatgg 
caccgtagat 
tattgagata 
tcttgctaca 
gtttagaaag 
catattgaac 
ggtgctgcag 
gcacggccgg 
gaagatctgc 
gatcaggatg 
cccccactgc 
gagggagtct 
atcgtgccct 
aagaaaggcg 
gtccacagct 



gcgcttgggc 
actgatgtcc 
gacgtgaagc 
aagttggagg 
aagagaggag 
acttcaattt 
gctctggaac 
ctggttgatc 
aagtttgttc 
gccatcctgg 
aatatctgtc 
cacttaccct 
aacgactact 
ggtgtcttga 
gctgaggggc 
tctgtgtttc 
ttggaataaa 



tccctgttcg 
accggcgctt 
gcttgcagac 
acttcatcaa 
tcacggaaga 
ccaaaatggc 
tgattattga 
aacttaaagg 
aaaacaagtg 
agatggaaca 
acagcctcct 
gcgtggccaa 
ggccccacga 
aatcgaacaa 
tggctgcctt 
aggttccacg 
ccatcctggg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1035 



<210> 64 

<211> 1838 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2615168CB1 



<400> 64 

tgagtggagt 

gaaggtgtct 

agagtcagac 

aggctgcaca 

caagctacca 

tacttgcaag 

cggcctgtgc 

ggaagatggc 

ctgcagcgtg 

atgcgcagag 

acgtgctgac 



tcactcacat 
gcatgtggga 
aactgctggg 
cacccccact 
ggccacaaca 
aactcttgct 
tcctgtgctg 
accgtccctc 
ggggaccctg 
gctactgagg 
caaccggagc 



ggattgaggc 
ctctgtacag 
ttcgtcccta 
ttcctctacc 
acgacaccca 
cacatcagct 
ctgagcaggg 
acccaggaga 
ggcaccctca 
gccattactg 
agagagcaaa 



ccagttcctg 
cccggtcctc 
agagaggtca 
aggccacacc 
cctcacctct 
aagagattgc 
caaccagtag 
tcctcagcca 
ggaccttctt 
gccaaggcgt 
ggcagctcat 



ggagaagaga 
tcccacatct 
tctgactggc 
ggaggcagtg 
ggcacctctg 
acctgctgac 
caccatgtct 
cctgggcctg 
gaacttcagc 
ggaccgcagt 
ctcacgaaac 



t gc t gggcag 6 0 
gggaggggc c 120 
tgttcagcct 180 
ctcacacagg 240 
agcatccacg 3 00 
ctagagattc 3 60 
gtgactggcg 420 
gccagcaaga 480 
g t ggac aagg 540 
gccattgtgg 600 
ttccaggagc 66 0 
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gcacccaaca 
ttgtgatggc 
tgaaggcctc 
cccagctgca 
acatcacatc 
gccgtgacag 
tgcagcgggc 
atcctgaaca 
aggaggctgt 
cggtgatcaa 
ctgagcccaa 
tgagtatcag 
atgcagtgaa 
gagacttccc 
aacctgggag 
tagcttcttg 
ttatatctgt 
atagtttctg 
attcttttgg 
acatggaccg 



ggacctgatg 
tctgctgcag 
agattctgct 
ggagtgcctg 
tgagaccagt 
ctactctgga 
agaaggacct 
cctcatccga 
ccagaaccgt 
gaacacaccg 
ttaccaagtc 
agctgagttc 
aggggattgc 
tgccccaccc 
accagctggg 
ttgaggctgg 
acctgggtca 
cctcactcat 
caaacttcgc 
ggaagaggca 



# 

aagfcctctac 
cccacagccc 
gtggacgtgg 
gcagtctaca 
ggcatcttgc 
atcattgact 
agcagagagg 
gtgtttgatc 
ttccatggag 
ctgtactttg 
ctgattcgca 
aggaagaaat 
cagtcagccc 
cacatgacat 
cctccaagta 
aactgtttct 
tccagctcct 
ccctcctgta 
tgttgtttgt 
ctggagttgg 
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aggcagcact 
agtttgacgc 
ccattgaaat 
aacacaattt 
aggacctgct 
ataatctggc 
aaacatgggt 
agtaccagcg 
atgctcaggt 
ctgacaaact 
tccttatctc 
ttgggaagtc 
tcctggcctt 
ccgaggatct 
ggataacccc 
ttaaaatccc 
tcttgggtgt 
ccctggccag 
gttccctgat 
aggtgcct 



ttccggcaac 
ccaggaattg 
tcttgccact 
ccaggtggag 
gttggccctg 
agaacaagat 
cccagtcttc 
gagcactggg 
ggctctgctc 
tcatcaagcc 
tcgatgtgag 
cctctactct 
gtgcagggct 
gagatttccg 
tcactgagca 
ttaattttcc 
ggggaaatga 
aacatctcac 
tgaaggttgg 



ctggagagga 
aggacagctc 
cgaaccccac 
gctgtggatg 
gccaaggggg 
gtccaggccc 
acccagcgaa 
caagagctgg 
ggcctagctt 
ctccaggaaa 
actgaccttc 
tctctccagg 
gaagacatgt 
tgtttggctg 
ccacattctc 
catctcaaaa 
gttttctttg 
tgatactcga 
gtggagcagg 



720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1838 



<210> 65 
<211> 1689 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2658329CB1 



<400> 65 

tcacgtcggc cggagctagc gctgcgtcct gggccacgcc tcccggcgca ccgacgcgcc 60 
tctccggtta ctaagcggcc ttggatacct ggccgcggga tgctgggcgg cgtcaggtaa 120 
ccatggagaa agagctgcgg agcaccattc ttttcaatgc ctacaaaaag gagatattta 180 
ccaccaacaa tggctacaaa tccatgcaga aaaaacttcg gagtaattgg aagattcaga 240 
gcttaaaaga tgaaatcaca tctgagaagt taaatggagt aaaactgtgg attacagctg 300 
ggccaaggga aaaatttact gcagctgagt ttgaaatcct gaagaaatat cttgacactg 3 60 
gtggagatgt ctttgtgatg ctaggagaag gtggagaatc cagatttgac accaatatta 420 
actttttact agaagaatat ggaatcatgg ttaataatga tgctgtggtt agaaatgtat 480 
atcacaaata tttccatcct aaagaagctc tagtttccag tggagtcttg aacagggaaa 540 
ttagccgagc tgcaggaaag gctgtgcctg ggatcattga tgaggaaagc agtggaaaca 600 
atgcccaggc tctcaccttt gtgtatcctt ttggtgccac attgagtgtc atgaaaccag 660 
cagtggcggt tctgtctaca ggttctgtct gcttcccact taacagaccc attttggctt 720 
tctatcactc aaagaaccaa ggtgggaagc tggcagtgct tggttcatgt cacatgttca 780 
gtgatcaata tttggacaaa gaagaaaaca gcaaaatcat ggatgttgtt ttccagtggc 84 0 
tcacgacagg agacatccac ctaaaccaga ttgatgctga ggacccagag atttctgact 900 
acatgatgct gccctacaca gccaccctat caaagcggaa tcgagagtgt ctccaggaga 960 
gtgatgagat cccaagggac tttaccaccc tcttcgacct gtccatcttc cagctggata 1020 
ccacctcctt ccacagcgtc atcgaggctc acgagcagct aaatgtgaaa catgaaccac 1080 
tccagctcat ccagcctcag tttgagacgc cgctgccaac ccttcagcct gcggtttttc 1140 
ctcccagttt ccgggagtta ccacctcctc ctctggagct atttgattta gatgaaacgt 1200 
tctcctctga gaaggcacgg ctggctcaga ttaccaataa gtgtactgaa gaagacctgg 1260 
aattttatgt caggaagtgt ggtgatattc ttggagtaac cagtaaacta ccaaaggacc 1320 
aacaggatgc caaacatatc cttgagcacg tcttcttcca agtggtggag ttcaagaaat 1380 
tgaaccagga acatgacatc gatacaagtg aaacagcatt ccagaacaat ttctgaagac 1440 
catgcctctt gaagcttttt ctgcctcctg attctctctt tgtaaactat tttcaaattg 1500 
tttttcaact ccttatcaaa attgtttata cactctttcc tccatgagct ctggaaggta 1560 
tatgcatctt ctgtaatact cagataggta taagattttt cacaaaatcc ttatgtaaga 1620 
tacattccat ttttaaaaat taaatgtatg gttgcatctg tctttttata ccctaaaaaa 1680 
aaaaaaaaa 1689 



<210> 66 
<211> 1788 
<212> DNA 

<213> Homo sapiens 



<220> 



58/82 



WO 00/77040 



PCT/US00/16G36 



<221> mis cofeature 

<223> Incyte ID No: 2708944CB1 



<400> 66 

cgagctcgcc cgctgtccgc cagcccgcgg gagggaggag agaagcgacg atgtccgcgg 60 
ttggctactc agtgtcttgg tctcaagttg cctcattgcg gctggcgttc ccaatacaga 120 
cgcatcgttt cttttttaat actccctaag aaagggaata accttcaagc tggcgggagc 180 
aatggttcac ataaagaaag gcgagctgac ccaggaggag aaggagctac tggaagtcat 240 
cgggaaaggt actgtccaag aagctggaac attattatcc agcaagaatg ttcgtgtcaa 3 00 
ctgtttggac gagaatggaa tgactcctct aatgcatgca gcatataaag gaaaactgga 360 
tatgtgcaaa ttactcctgc gacatggagc cgatgtaaat tgtcatcagc atgaacatgg 420 
atacacagcc ctcatgtttg ctgcactttc tggtaataaa gacatcacat gggtaatgtt 480 
agaagctggt gctgagacag atgttgtcaa ctctgtggga agaacagcag ctcagatggc 540 
agcctttgtg ggtcaacatg attgtgtgac cataatcaac aatttctttc ctcgagagag 600 
actggattat tacactaagc cccagggact ggataaagag ccaaaactgc ccccaaagtt 660 
ggcaggcccg ctgcacaaaa ttatcaccac aacgaatctt catcctgtca agatcgtgat 72 0 
gcttgtaaat gagaatcctc tgctgacaga agaagcagcc ctgaataaat gctacagagt 780 
gatggatttg atttgtgaga aatgtatgaa gcaaagagac atgaatgaag tattggctat 84 0 
gaagatgcat tacataagct gtatctttca gaaatgcatt aacttcttaa aagatggaga 900 
gaataaactg gacaccttga tcaaaagctt gttaaaaggc cgagcttctg atggctttcc 960 
agtgtatcaa gaaaagatca ttagagaaag tatcagaaaa tttccttact gtgaagctac 1020 
actcctccag cagctggttc gaagcattgc tcctgttgaa attggttctg atcccactgc 1080 
attctccgtc cttacccaag ccatcactgg ccaggtgggt tttgtggatg tggaattttg 1140 
cactacctgt ggagaaaagg gagcaagtaa aagatgttca gtttgcaaaa tggtaatata 1200 
ttgtgatcaa acctgccaga aaacacactg gtttactcat aagaaaatct gtaagaatct 1260 
gaaggacatt tacgaaaagc aacagttgga ggctgccaaa gaaaagagac aagaggaaaa 132 0 
ccacggcaaa cttgatgtca attctaactg tgttaatgaa gagcaaccag aggctgaagt 1380 
aggtatctct caaaaggatt ccaatcctga agattccggg gaaggaaaga aagaatctct 1440 
tgaaagcgaa gctgagttgg aaggcttaca ggatgctcct gcagggccac aggtgtctga 1500 
ggagtaaaag ccagagcaag tgccagtgtg gatggtcctc accctgcaag aagctggaaa 1560 
actcctagga atgcattgtc ctcaccttgt tatacctgcg tggcaccatg gcaggattcc 1620 
acatttcata gaatacaggt tttcaagcaa acccctgttg accatgccct aatttcctat 1680 
tgatttctgt tgtataattg aatggatatt cctatggaaa attttttgtt tcaaaataca 1740 
ggaaaaacat tcctattacc tttctgaggc tggctttcca gcaattgt 1788 

<210> 67 

<211> 2160 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 3315012CB1 



<400> 67 

atgctacgcg cgcccggctg cctcctccgg acctccgtag cgcctgccgc ggccctggct 60 
gcggcgctgc tctcgtcgct tgcgcgctgc tctcttctag agccgaggga cccggtggcc 120 
tcgtcgctca gcccctattt cggcaccaag actcgctacg aggatgtcaa ccccgtgcta 180 
ttgtcgggcc ccgaggctcc gtggcgggac cctgagctgc tggaggggac ctgcaccccg 240 
gtgcagctgg tcgccctcat tcgccacggc acccgctacc ccacggtcaa acagatccgc 300 
aagctgaggc agctgcacgg gttgctgcag gcccgcgggt ccagggatgg cggggctagt 3 60 
agtaccggca gccgcgacct gggtgcagcg ctggccgact ggcctttgtg gtacgcggac 420 
tggatggacg ggcagctagt agagaaggga cggcaggata tgcgacagct ggcgctgcgt 480 
ctggcctcgc tcttcccggt tcttttcagc cgtgagaact acggccgcct gcggctcatc 540 
accagttcca agcaccgctg catggatagc agcgccgcct tcctgcaggg gctgtggcag 600 
cactaccacc ctggcttgcc gccgccggac gtcgcagata tggagtttgg acctccaaca 660 
gttaatgata aactaatgag attttttgat cactgtgaga agtttttaac tgaagtagaa 72 0 
aaaaatgcta cagctcttta tcacgtggaa gccttcaaaa ctggaccaga aatgcagaac 780 
attttaaaaa aagttgcagc tactttgcaa gtgccagtaa atgatttaaa tgcagattta 840 
attcaagtag cctttttcac ctgttcattt gacctggcaa ttaaaggtgt taaatctcct 900 
tggtgtgatg tttttgacat agatgatgca aaggtattag aatatttaaa tgatctgaaa 9 60 
caatattgga aaagaggata tgggtatact attaacagtc gatccagctg caccttgttt 1020 
caggatatct ttcagcactt ggacaaagca gttgaacaga aacaaaggtc tcagccaatt 1080 
tcttctccag tcatcctcca gtttggtcat gcagagactc ttcttccact gctttctctc 1140 
atgggctact tcaaagacaa ggaaccccta acagcgtaca attacaaaaa acaaatgcat 1200 
cggaagttcc gaagtggtct cattgtacct tatgcctcga acctgatatt tgtgctttac 1260 
cactgtgaaa atgctaagac tcctaaagaa caattccgag tgcagatgtt attaaatgaa 1320 



59/82 



WO 00/77040 




PCT/US00/16636 



aaggtgttac ctttggctta cWacaagaa actgtttcat tttatgaaga tctgaagaac 13 80 
cactacaagg acatccttca gagttgtcaa accagtgaag aatgtgaatt agcaagggct 1440 
aacagtacat ctgatgaact atgagtaact gaagaacatt tttaattctt taggaatctg 1500 
caatgagtga ttacatgctt gtaataggta ggcaattcct tgattacagg aagcttttat 1560 
attacttgag tatttctgtc ttttcacaga aaaacattgg gtttctctct gggtttggac 1620 
atgaaatgta agaaaagatt tttcactgga gcagctctct taaggagaaa caaatctatt 1680 
tagagaaaca gctggccctg caaatgttta cagaaatgaa attcttccta cttatataag 1740 
aaatctcaca ctgagataga attgtgattt cataataaca cttgaaaagt gctggagtaa 1800 
caaaatatct cagttggacc atccttaact tgattgaact gtctaggaac tttacagatt 1860 
gttctgcagt tctctcttct tttcctcagg taggacagct ctagcatttt cttaatcagg 1920 
aatattgtgg taagctggga gtatcactct ggaagaaagt aacatctcca gatgagaatt 1980 
tgaaacaaga aacagagtgt tgtaaaagga caccttcact gaagcaagtc ggaaagtaca 2040 
atgaaaataa atatttttgg tatttattta tgaaatattt gaacattttt tcaataattc 2100 
ctttttactt ctaggaagtc tcaaaagacc atcttaaatt attatatgtt tgggacaata 2160 

<210> 68 
<211> 1156 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 4155412CB1 



ctgagacgag cgggagcgcg gacagcagcc tctgctgccc tgacttttta agaaatctca 60 

atgaactatt tgtagagaat cactgatccg gcctgcaagc attttgcacg gcaaaaatat 120 

cgatcagtgt taagtgaaga tcacatttta tatgcgatct tgactttttt gtcttacatt 18 0 

atatttttat agattttgtt ataaacatgg tgctgggaaa ggtgaagagt ttgacaataa 240 

gctttgactg tcttaatgac agcaatgtcc ctgtgtattc tagtggggat accgtctcag 3 00 

gaagggtaaa tttagaagtt actggggaaa tcagagtaaa atctcttaaa attcatgcaa 3 60 

gaggacatgc gaaagtacgc tggactgaat ctagaaacgc cggctccaat actgcctata 420 

cacagaatta cactgaagaa gtagagtatt tcaaccataa agacatctta attgggcacg 480 

aaagagatga tgataattcc gaagaaggct tccacactat tcattcagga aggcatgaat 540 

atgcattcag cttcgagctt ccacagacac cactcgctac ctcattcgaa ggccgacatg 600 

gcagtgtgcg ctattgggtg aaagccgaat tgcacaggcc ttggctacta ccagtaaaat 660 

taaagaagga atttacagtc tttgagcata tagatatcaa cactccttca ttactgtcac 720 

cccaagcagg cacaaaagaa aagacactct gttgctggtt ctgtacctca ggcccaatat 780 

ccttaagtgc caaaattgaa aggaagggct ataccccagg tgaatcaatt cagatatttg 840 

ctgagattga gaactgctct tcccgaatgg tggtgccaag gcagccattt accaaacaca 900 

ggccttctat tgctaaaggg aaattgaggg agctaaacag cttgtggcta acatgcgtgg 960 
ggaattcctt aacatctgga aagaaccggg acgtggaaat ggccagtttg ctgaaaattt 1020 

ccaacagttt tcccccctcc aatgcttcga actgaaggaa taatcccggc tgggaataat 1080 

ccactcaatg ggtaaaatgg tgggaaaatc ccctggagcc aatggaattt aaattcctct 1140 
aaatttggcc cacttg 1156 

<210> 69 

<211> 1981 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 4831840CB1 



ggctggggaa gatggcggtg gctggggcgg tgtccgggga gccgctggtg cactggtgca 60 

cccagcagtt gcggaagact ttcggcctgg atgtcagcga ggagatcatt cagtacgttt 120 

tgtcaattga gagtgctgaa gagatacgag aatatgttac tgatctcctc cagggaaatg 180 

aaggcaaaaa aggtcaattc atagaagaac ttataaccaa atggcaaaag aatgatcagg 240 

agttgatttc ggatcctttg cagcagtgct tcaaaaaaga tgaaatttta gatgggcaga 3 00 

aatcaggcga ccatctaaag cggggtagga agaaagggag aaacagacag gaagttcctg 3 60 

catttactga acctgacacg actgcagagg ttaaaacacc ttttgatttg gccaaggcac 420 

aagagaacag caactccgta aagaagaaga caaagtttgt caatttatac acaagagagg 480 

gacaggacag gcttgcagtc ctgctccctg gtcgtcaccc ttgtgattgc ctgggccaga 540 

agcacaagct catcaataac' tgtctgatct gtgggcgcat tgtctgtgaa caagaaggct 600 

caggcccttg cttattctgt ggcactctgg tgtgtactca tgaggaacaa gatattttac 660 



<400> 68 



<400> 69 



60/82 



WO 00/77040 



agcgtgactc 
ctggaaaggt 
ctggtctgga 
ttcgaaggac 
ggttgtccaa 
gacacgcctc 
aagaagaaaa 
ccaatggaac 
gagttctggt 
cagcctcaca 
agcaccagtt 
tctctgtaca 
gatcctggta 
cccctcaaga 
aatttcctaa 
tgtcccagaa 
ttgttttcat 
caaaaatctg 
agaataaagc 
ctaaaattgc 
tcaggcttga 
gtgggattcc 
a 



aaacaagagc 
ggacatctct 
gaaggctatc 
ccaagtcatt 
acttgagcgg 
tcgactttct 
ttcactagca 
cttgaaccag 
aaatcccaac 
gaagaaggct 
gcgaatccag 
tcagccctgg 
caccccccac 
agtctcagaa 
tgactatccg 
gcaatttaag 
ctgcaaaaat 
gaaattggat 
tgtctgaccc 
tatctactgg 
atatctcaga 
tactttatgt 



^l^aaactgc 

accaaggacc 
aagcataaag 
gatgatgagt 
gaaaccttgc 
aagaaggtca 
gagtatcata 
ccactgacca 
atgtaccagt 
ttccgttctt 
gatcaagaat 
gcttctctgc 
agaggacgac 
ctccaggcta 
tcaggttgtc 
gagcagtttc 
cctcaggaaa 
tccaagatcc 
aggagaaaag 
tcctttggaa 
acttaaactc 
aatggggtcg 



PCT/US00/16636 



taaagaaact catgtcagga U^iggagaatt 720 
ttcttcctca tcaagaattg cgaattaagt 780 
acaaactgtt agagtttgac agaactagta 840 
cagattactt tgccagtgat tctaaccaat 900 
agaagcgaga ggaggagctg agagaacttc 9 60 
ccattgactt tgcaggaagg aagatcctgg 1020 
gcagactaga tgagacaata caggccattg 1080 
aattggatag atcttctgaa gagcctttgg 1140 
cccctcccca gtgggttgac cacacaggtg 1200 
caggatttgg actagagttc aactcatttc 1260 
ttcaggaagg ctttgatggt ggctggtgcc 1320 
ttgtcagagg gattaaaagg gtggagggca 1380 
tttggatagc agccacagct aaaaaaccct 1440 
catatcgtct tcttcgtggg aaagatgtgg 1500 
ttctgggctg tgtggaccta attgactgct 1560 
cagacatcag tcaagaatct gattctccat 1620 
tggttgtgaa gtttcctatt aaaggaaatc 1680 
atcaaggagc aaagaagggg ttaatgaagc 1740 
gaactataca gcatagtgga gttttgtgta 1800 
ttgaagtagt agaaacctaa aggcttggcg 1860 
ttaccaaaat ctgtatattt ttcttaagga 1920 
aaatctttga acacattatt tataaaaacc 1980 

1981 



<210> 70 
<211> 1832 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 5676581CB1 



<400> 70 

cctagagttg 

gcaacaggcg 

gtttccgaaa 

agtacattgg 

cctgtaggag 

acatctaccc 

caatatgata 

gtctgggatg 

tgggagttac 

tctgtgccag 

acgatttctg 

gaagtaaatg 

gcatcgctgc 

agaaaagtaa 

gaagaagcaa 

tctcaggagg 

gatacccttg 

gaaggagatg 

caagatggaa 

accagtccag 

ggagaaactg 

atggaagtag 

gaacaggatt 

tttctaggca 

actcgaattg 

caggatgcag 

tgccactggg 

acatgatttg 

ctgcagttag 

attggagaaa 

ggacaaagct 



ttcagaatcc 
gtctatgctt 
aactagcaca 
ccaacttcct 
cttctgctcc 
tttcttcttc 
ctcagtgttc 
agaacacggg 
cccaatatct 
gtgctgaaac 
tttccagtag 
aaggaattca 
ttgctccttt 
tttgtaaaga 
caacaatagt 
atctttgcag 
aactggagct 
gtagtgtgtc 
tgcgtaggct 
aatctaccag 
caattggtgc 
aagaatctcc 
tgaaatttca 
ttaatagaca 
cagactggcg 
cagaacaact 
acaggtacgc 
tgatcaccac 
cggtgctggg 
caaaagcaat 
ggtttttact 



tccaaaacca 
gcttggtgct 
atccaaagag 
agcggagatc 
acctccaact 
tacttcaaat 
actggcagga 
atgttattat 
tgccacacag 
tagttttgtg 
taaaagtgga 
ggctctctca 
attgcctgag 
ggagccagtt 
aaagccacag 
tgttgtccaa 
agttttggaa 
agggtctagt 
tatgtctaaa 
taggagttct 
tgaaaattca 
agagaaaata 
gattggagaa 
atccatctcc 
ggaaggggct 
aaaacagtat 
actcttctcc 
catctgacga 
caggcaaagc 
gtttaaagtg 
ctccagagtg 



gtcatgacca 
tatgctgaca 
acaaatggaa 
gatgccataa 
ccacctcgac 
ggaacagact 
gtcggaattg 
tattggaata 
gtacagggat 
gtaaatacag 
ccagtcatag 
aatagtgagg 
ggaataaaag 
tcagaagtaa 
gaaattatgt 
tctggagaaa 
aggaaaaaag 
ccacgttctg 
agaggaaaat 
agtaaaactg 
gaaaaaatag 
aaagtacaga 
ctggcaaata 
aactttcatg 
cttaatggaa 
gaaataaacg 
ccttttcacc 
catgacagcc 
accgcaagca 
ttccatgtaa 
tt 



ctagacccac 
gtgatgacga 
accagtcaac 
cagctcctca 
cagagccaaa 
ccacccaaac 
agatgggcga 
cacaaacaaa 
tacagcatta 
acatatattc 
ccaagcgaga 
aggagaagaa 
aagaagaaga 
aagaaacaag 
tggacaatat 
gtgaggagga 
cagagttgcg 
atatcagcca 
ggaagatgtt 
gacgagatac 
atgagaattc 
caacaccaaa 
ccctgacaag 
tgctgctctt 
actaccttaa 
ccactcctaa 
tttcaccttt 
tgtctggaga 
caaagtttgg 
cggatttttt 



agctgttaaa 
tgacaatgat 
tgatattgat 
gcctgcagct 
ggaagcagca 
atctggttgg 
ttggcaggaa 
tgaagtgact 
ccagcccagt 
taaggagaaa 
agttaaaaag 
aggggtggca 
gagatggaga 
tacaacagta 
agaagaccct 
agaggaacaa 
agccttggag 
gccagcatct 
tgttcgagct 
tccagaaaat 
agataaagag 
agtagaagaa 
taaattcgag 
acagactgag 
acgaaaactt 
aggctggtcc 
gacatctcag 
ctgagagcag 
ccagccacac 
cccaagatat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1832 



<210> 71 



61/82 



WO 00/77040 



PCT/USOO/16636 



<211> 1772 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 034159CB1 



<400> 71 

acagtactga tattaagcag catccaacac aggcctactc ttacgacatg tgactttact 60 
gttttccgtt tttgttgaaa gagtcattaa cagttaggag ttgatggcag tttcaataac 120 
aggtcattgc cgagaaaagg atagcactat aatatgcaga aatctacaaa ttctgatact 180 
tccgtggaaa cactgaattc tacccgccaa ggcacaggag ctgtgcaaat gagaatcaaa 240 
aatgccaaca gccaccatga caggctcagc caaagtaaat ccatgatcct caccgatgtc 300 
gggaaggtca ctgaacctat atccagacac agaaggaatc attcacagca tatcttgaaa 3 60 
gatgtcattc ctccattgga acaactgatg gttgaaaaag aaggttatct gcaaaaagct 420 
aaaattgcag atggaggaaa gaaactaagg aaaaactggt ctacttcctg gattgttctt 480 
tctagtcgaa gaattgaatt ttacaaagaa tccaagcaac aggctctgtc caatatgaaa 540 
actgggcaca aaccagaaag tgtggatttg tgtggagcac acattgaatg ggccaaggaa 600 
aaatcgagca gaaagaatgt ctttcagatc acaacagtat caggaaatga gttccttcta 660 
cagtcagata ttgacttcat catattggat tggttccacg ctatcaaaaa tgcaattgac 720 
agattgccaa aggattcaag ttgtccatca agaaacctgg aattattcaa aatccaaaga 780 
tcctctagca ctgaattgct aagtcactat gacagtgata taaaagaaca gaaaccagag 840 
cacagaaaat ctttaatgtt cagactgcat cacagtgctt ccgatacaag cgacaaaaat 900 
cgagttaaaa gcagattaaa gaagtttatt acccgaagac cttccctgaa aactctgcaa 960 
gaaaaaggac ttattaaaga tcaaattttt ggctctcatc tgcacaaagt gtgtgaacgt 1020 
gaaaattcca cagttccgtg gtttgtaaag caatgcattg aagctgttga gaaaagaggt 1080 
ctagatgttg atggaatata tcgagttagt ggcaatctgg caacaataca gaagttaaga 1140 
tttattgtca accaagaaga gaagctgaat ttggacgaca gccagtggga ggacatccac 1200 
gttgtcaccg gagcactgaa gatgtttttc cgggagctgc ctgagccgct cttcccttac 1260 
agtttctttg agcagtttgt ggaagcgatc aaaaagcaag acaacaacac aagaattgaa 1320 
gctgtaaaat ctcttgtaca aaaactccct ccgccaaatc gtgacaccat gaaagtcctc 1380 
tttggacatc taactaagat agtggccaaa gcctccaaga acctcatgtc cacgcaaagc 1440 
ttggggattg tatttggacc tacccttctg cgagctgaaa atgaaacagg aaacatggcg 1500 
atccacatgg tctaccagaa ccagatagct gagctcatgc tgagtgagta cagtaagatc 1560 
ttcggctcag aggaagactg acagacaaga caagctactg aatacgttca catctgtctt 1620 
gatgcctaat atttttacat ttctgtaaac atatttctga aatatttttt gcctttcaag 1680 
cgacagatgc ctcattttgt gaaaacttaa tgatgatttt gtgtttaagt tccaaacatt 1740 
tgaataaaat aattgacaat aaaaaaaaaa aa 1772 

<210> 72 

<211> 1488 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 129023CB1 



<400> 72 

cgggacacaa gatggcggca gcggcgctgg ggagggcgag gcggaggcgg caaaacgggc 60 
ggtcgagcag aacgtgtagc cgcgtcccct ccagtccgct ccgggcagct gctgatgcaa 120 
ggaatcccct gggctcccgt ccactccact gctgaccagc ccattcgcct gtgctgagtc 180 
ttcctgcagg cctttccttg cctctgtggg accctgtggg ggtccatccg gctggagaag 240 
aaaagcctct catgctaacg ttgcagaccc cagagggtcc tgtgtgggtg tggagatggc 3 00 
caatgagaat cacggcagcc cccgggagga agcgtccctg ctgagtcact ccccaggtac 3 60 
ctccaatcag agccagccct gttctccaaa gccaatccgc ctggttcagg acctcccaga 42 0 
ggagctggtg catgcaggct gggagaagtg ctggagccgg agggagaatc gtccctacta 480 
cttcaaccga ttcaccaacc agtccctgtg ggagatgccc gtgctggggc agcacgatgt 540 
gatttcggac cctttggggc tgaatgcgac cccactgccc caagactcaa gcttggtgga 600 
aactcccccg gctgagaaca agcccagaaa gcggcagctc tcggaagagc agccaagcgg 660 
caatggtgtg aagaagccca agattgaaat cccagtgaca cccacaggcc agtcggtgcc 720 
cagctccccc agtatcccag gaaccccaac gctgaagatg tggggtacgt cccctgaaga 780 
taaacagcag gcagctctcc tacgacccac tgaggtctac tgggacctgg acatccagac 840 
caatgctgtc atcaagcacc gggggccttc agaggtgctg cccccgcatc ccgaagtgga 900 
actgctccgc tctcagctca tcctgaagct tcggcagcac tatcgggagc tgtgccagca 960 
gcgagagggc attgagcctc cacgggagtc tttcaaccgc tggatgctgg agcgcaaggt 1020 



62/82 



WO 00/77040 



PCT/US00/16636 



ccctgttgcc cagcaactgt gaaccagtcg tgtcaccttc 1080 

acgacattcc tatcaggtta tcccgaatca agttccggga 1140 

ttaaatatgc ggaggccgcc aggcggctca tcgagtccag 1200 

ggaaggtggt caaatggaat gtggaagaca cccttagctg 1260 

gcctccaaag aggggcaaca tggatcgcct ggaacatctg 1320 

agtctcggcc gcaaccaaga ctccgtgcga acgcatctgc 1380 

gaggtagtaa acgatccgga gaagacctgc atcctcagaa 1440 

ctagtgagcc gctatgttgt accatcgc 1488 

<210> 73 

<211> 2430 

<212> DNA 

<213> Homo sapiens 



ggtagacaaa 
catgtttcgt 
ggaagccaag 
gagtgcatcc 
gcttcgcgaa 
cggaagcagt 
agtagatcta 
acactctcca 



ggatctgacc 
gaaatcatga 
cgcctgctct 
cctgacagta 
ggaccactca 
gctggcccca 
caaatctcct 
gagtgaggcc 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1358940CB1 



<400> 73 

ggcccagcgg 

tgtttcatct 

tgtttggacg 

tctcgatggt 

gctttcctgg 

gcagaatctc 

aaagatgaag 

gaagagaaaa 

aggattcgag 

atcagtgagc 

acacccaatg 

gatggaaaga 

acccttggag 

tccactcctc 

gaagtagata 

ggtatgcgcc 

gccctcacaa 

ctcgaatttc 

cctgtggatg 

gtgcagtcca 

aagggtctga 

gtggacactg 

cagcaggtgg 

tttgcactct 

atgaagcaac 

atgcaggcca 

taaccccaca 

gtctcggcag 

acctcaggac 

tctgattctg 

ctcagacatt 

cagggaacca 

ctatggattt 

acaagtaggg 

atgacctagg 

tccctgggat 

agattcagaa 

cctccctaag 

gttccctgca 

agcctgtttg 

ataaacatcc 



ctaggagagt 
ctatggctgt 
cgatgtttcg 
accatggagg 
gagcatctgc 
caccatgtgt 
gggatgtttg 
agaagaaacg 
cctactccac 
ctggtgaagc 
aaaaacaacc 
aaatttccca 
aatgtgggct 
agagaaattt 
tggaagaatt 
acagagatcg 
cctacttttt 
agcgtaaact 
ggagaattac 
agaagctgac 
catttcagga 
cattgagttt 
ccaggacagt 
ttgactgtga 
ggctgatgag 
tgtggaaatg 
ctgcaagagg 
agccctttgt 
tctgtcggtt 
cccaatgagt 
tgtcagcctc 
tccacacacc 
atactaacaa 
aagatatctg 
caccctacgc 
ggctccaagg 
tttaaagtag 
atgactccag 
agtgccattt 
ccctctgtct 
ccaaagttaa 



cacgtgagat 
cagaggtggg 
tctgaactca 
atcacagccc 
agtaactgca 
agacaaccta 
taaccatgag 
ttctggattc 
gccagacaaa 
agaagtgttt 
agaacacttg 
ggaacgagaa 
catatccttt 
tgaaattgcc 
tgaacaggtt 
tccaactact 
tggagctgat 
gcagcatgat 
tgagaggcag 
cgccatgcag 
ggtggagaac 
ttaccatatg 
ggctaaagtg 
tggcaatggc 
aggcctggaa 
tgcacaggaa 
ggacccctcc 
gctgctgctt 
tcccctcttt 
atccccatag 
aacagcacca 
tgccaaccct 
gaacattcct 
gcaggcagaa 
tagtgtgaga 
cctgctctag 
ttgcttcatg 
tgtgaccctg 
cctctccagg 
taaacaattg 
aaaaaaaaaa 



tgggcggagg 
cggctttgac 
ctttctgctt 
atccagatcc 
agtactggtc 
aaaagtgaca 
aaaaagactg 
agagacagaa 
atcttccgat 
atgacaccag 
ggtctggatc 
aaatttgctg 
tcagactaca 
ttcaagatgt 
cagagcatca 
ggcaacaccc 
ctgaagggaa 
gttctgaagc 
tttggtggca 
aggcagctca 
ttctttactt 
gctggagcat 
gagctctcag 
gaactgagca 
aagcccaaag 
actgcctggg 
acccccagta 
ctggaagtag 
acccttcccc 
gttctcaaaa 
cccattcaag 
gggaagcatc 
tgacttccct 
agaagtctgt 
agcctgcgcc 
gaaggcagca 
gctctgtgca 
tgcttagtga 
atgggcctct 
taaatatcac 



gggtggaggt 
cgagaggctg 
tggcagaact 
ggcgaagact 
ttttgtggaa 
tcggtgataa 
cagatcttgc 
aagtgatgga 
attttgccac 
aagattttgt 
aatatataat 
atgaaggcag 
ttttcctcac 
ttgatttgaa 
ttcgctccca 
tcaagtctgg 
agctgacaat 
ttgagtttga 
tgctacttgc 
agaagcactt 
tcctaaagaa 
ctcttgataa 
accacgtgtg 
ataaggaatt 
acatgggttt 
acttcgcttt 
ccctggaccc 
tctcccttcc 
gtccccgtgt 
acatgaacaa 
catcctgtgg 
cagttctcaa 
cctgctggtg 
gatgataaac 
ccaggaagga 
tgctcagtgg 
ctcccttttc 
gcaatagtga 
aaagctgagg 
ttaaattata 



ttgtctccgc 
ctggagctcg 
ggctgtgggt 
aatgatggtg 
gagggcccat 
agggaagaat 
ccctcaccca 
atatgagaat 
cttgaaagtc 
gcgatccata 
aaaacgcttt 
tatattttac 
aactgttctt 
tggagatgga 
aaccagtatg 
cttgtgttca 
caaaaacttc 
acgccatgac 
ctacagtggg 
caaagaagga 
cattaatgat 
agtgaccatg 
tgatgtggtg 
tgtttccatc 
cactcgcctc 
acccaaacag 
cctccgcaga 
tcccgggatg 
tctgctgggc 
gtctgtaaag 
ataaagaatt 
atcgtttttg 
ttttaaagcc 
aatgatgagg 
tctgtgttag 
gaacacagca 
ttcctcgcag 
ttgagctcat 
cc tggctcag 
accatttgca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2430 



<210> 74 
<211> 1411 
<212> DNA 

<213> Homo sapiens 



<220> 



63/82 



WO 00/77040 



PCT/US00/16636 



<221> mis cofeature 
<223> Incyte ID No: 



1682320CB1 



<400> 74 

agcactagtc 

aggcggaagc 

cttgagttct 

gttgggcttc 

agactgcaga 

gggggactct 

ttattttgct 
cgtgggcgac 
accagcgcta 
ttcagctgcg 
gcccggccct 
acaatctagc 
atagtcagaa 
tggccttttt 
ataggcagcc 
gcagctacca 
agatgctcct 
cctcagcaag 
gcacaaactt 
tatacctttc 
catacatcag 
tgttagatgt 
cagaaagaat 
gggcgctcta 



gcgcccactc 
cagtgtccca 
tcggaaagtc 
attattccca 
aaggagaggt 
ccaggaagaa 
cccaacccca 
agttatggag 
ccgggaaggg 
gggtctggat 
cacgcctgaa 
tgcttgtctc 
agtcctggaa 
ccatctgcag 
taaagatgcc 
tagaaaagag 
ccagttgaac 
agaaattaac 
cactacttag 
aatacatgtt 
ttgtgggtgg 
acttatatac 
tgtcttcctt 
gctcatgcct 



ccttctactt 
ggcgttctca 
tcatccaccc 
catccctttc 
ggggctttca 
gggtaatttc 
gagcttcact 
aagcgtctgc 
aagtaccgag 
ccgagtctgc 
caagaaaaca 
cttcagatgg 
cgacagcctg 
gactatgacc 
aacgtccggc 
aagcagctct 
ttaggtggac 
cctatacctc 
acagtctgag 
attgttgcag 
aattaatcat 
agaggcgaca 
tattttttca 
gtaatcccag 



ccaggtcggg 
cgcccgcaac 
ccacatcgcc 
cttactactt 
gtagaaacaa 
ctgcctcctt 
tgctccttca 
aggaggctca 
atgctgtgag 
cctctccgtt 
tattgcatac 
agcccgtgaa 
ataatgccaa 
aggcccgcca 
ggtacctcca 
acctgggcat 
cattaaacat 
tgacccaggt 
tctttttctg 
atatttggct 
atctctggta 
gcttgcagag 
gaagcccaaa 
c 



ggggggcggg 
aattcctgag 
tctttaggaa 
gcctgcactt 
gcaaaccgca 
aaattggctg 
cttcccagtt 
gctgtacaag 
taggtaccat 
acctaatctc 
cacccagaca 
ctacgaacga 
ggccttgtat 
ctacctcctg 
gctgacacag 
gtttggttaa 
gcatgaagga 
ggatttttgt 
tctatccatc 
tgagaaatat 
tagatttttc 
gacgacatag 
ccaaagtggg 



tccaatagaa 
tagggccttg 
gtcacttaat 
cttgagaaaa 
ggtccctgtg 
ctactgtcag 
ccgcaagaac 
gaggaaggga 
cgagctctgc 
ggacctcagg 
gactgctata 
gtgagagaat 
cgggccggag 
gctgccgtga 
tcagaactca 
caaagaagaa 
gaaatctgag 
ttctagttct 
tgtttatttc 
aatcagaaaa 
atgacagtgt 
taaggatagg 
aaatatggcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1411 



<210> 75 

<211> 653 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 1728263CB1 



<400> 75 

gcggtggtag 

aggccagagc 

aagtatttaa 

ctgctgttgt 

cttcaataaa 

aaaagaagga 

cctactatcg 

ccaaattacc 

gtcacgtcag 

tattgtgaac 

tactctacta 



ctggttgggt 
caggtcgcgg 
agcatgtgat 
aatgctgttt 
tccaaatact 
agctcaacga 
tggattttta 
ggaaaggac t 
ctttagagac 
tacttttggt 
atgatgtcat 



tgctggggct 
acgtgggaag 
gaagatcaca 
gggtacaagc 
tctggtatat 
tatcggaacg 
actttggaag 
gttcttgagg 
tttgaatatg 
aactctgggg 
gctacagact 



cggttgttgt 
ccagtccctc 
aaggatatct 
cc tccaagat 
tactcgaggg 
aagtaagaca 
atttcaaaaa 
tattcaggga 
ccctgaacta 
agatcaatag 
tgtgattaaa 



<210> 76 
<211> 1448 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 



1867626CB1 



<400> 76 

aacccggat t 

tgtctcctcc 

tggtatttac 

tgaaatacct 

acggagaaat 

ttcattgggc 



ctgcatttcc 
gcaaagcatt 
aagagagaaa 
ccggacaagg 
tcctttggaa 
aaaatggggg 



cctaactctg 
ttaactcgaa 
tcattttacc 
atctgggggt 
gttattccgt 
aacaacctat 



agtcgcgatg 
ggaacacagg 
cagcagagag 
agaagtggat 
gtttttaaat 
catcttcaca 
agcatttagg 
agtagatcga 
tggacagaag 
attgtaatgt 
catttaaaaa 



ttcttctccg 60 
aagtgggtgg 120 
gactttaaaa 180 
tctgtgatgt 240 
attgtcagga 3 00 
gcctttgaca 3 60 
caggtggctc 420 
gattcagatg 480 
gaagcctaac 540 
cagcagactc 600 
ttt 653 



agactcattt 
taagcaaatg 
taagagaact 
gggggtggaa 
agcataagag 
cttcagcact 



tgtggaatag 
ccgcctctgt 
aattgaattg 
aagcaactgc 
ctgaaacttc 
cgagctcatg 



agttgatcgc 60 
ttgaacgttt 12 0 
gcagcatcct 180 
gaaatagcag 240 
agagcaagtt 300 
tcttccaaat 3 60 



64/82 



WO 00/77040 



tgacccaaac 
tttctatgac 
aataaatagt 
gtgggctgat 
ttcgaaattt 
atcacaagag 
tcagtctcct 
gacacagaac 
ttcagcaatc 
actcactgca 
tgcctatcaa 
tagccaagca 
ggaagagaca 
tgccagagaa 
tcggaacaaa 
gaatgaacaa 
gatatttgaa 
aaagtgaaat 
tagctctg 



acaaagaaga 
agcacaagaa 
accatcaccc 
agccgggcaa 
gcagaaaagt 
aagatggaac 
ttaacaccgg 
tcagagccaa 
agcaaacatt 
gccctgctgg 
gaggaagcag 
aatgcagtac 
ctgaaactga 
ctacaagaac 
gacctggagg 
gaagcttttc 
ctaacagaat 
ttcagtgcca 



PCTAJS00/16636 



;gggtacc 
atgtgtatag 
caaacatgac 
acaccgttta 
ttcaggaatt 
ttaccagtac 
aaagtatcaa 
gggctgaacc 
gggaggctga 
agtccactgc 
aacgtctgca 
atactcataa 
aggaagagga 
agagggattc 
gacaactgtc 
gcaataacct 
tacgagataa 
attaattaaa 



caccagcaag 
gataatcagt 
atttactaaa 
tggattggga 
taaagaagct 
accttcacag 
cgggacagat 
aactcagaat 
actggctacc 
caatgtgaaa 
caagcgggtg 
gacagaatta 
aatagaaagg 
tttgactcag 
tgacttagag 
gaagacactc 
cttggccaag 
agatacactg 



catgcagtta 
ttagatggct 
acatctcaga 
ttctcctctg 
gctcgactag 
gaatccgcag 
gacgaaagaa 
gcattgccat 
ctcaaaggaa 
caatggaaac 
actgaacttg 
aatcagacaa 
t taaaacaag 
aaactacagg 
caacgtctgg 
ttagaaattc 
ctactagaat 
tctctcttca 



ctgtgtctta 
caaaggcaat 
ggtttggcca 
agcatcatct 
caaaggaaaa 
gcggggatct 
cacctgatgt 
tttcacatag 
ataatgccaa 
agcaacttgc 
aatgtgttag 
tacaagaact 
aaattgataa 
aagtagaaat 
agaaaagtca 
tggatggaaa 
gcagctaagg 
taggaatgtt 



420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1448 



<210> 77 

<211> 1538 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1990126CB1 



<400> 77 

gcgccccagc 

ggctgccgcc 

gcaccttcct 

cagaattgga 

cagaaaaaat 

aagaatttgt 

ttttgggaca 

atgcccttat 

ttcaaacgtc 

aaacaattgc 

aaacgagact 

gaataactca 

tcagcagtac 

cttactatgc 

catccaatta 

caaatgcgat 

ccaacctcag 

atgatgcagc 

gtgttgttgg 

tgccaattac 

catgactttg 

taatgtgtta 

tgcatggtaa 

ttacaaaatt 

acttaaagca 

taaaataaat 



ccggccgcgg 
taggatgaat 
cagtcgcgcc 
tgctcactta 
aatgaaacaa 
ttatgagaaa 
atatatgatt 
taaatgtgga 
agccttaaat 
taaagaaagg 
aaaaaaggca 
aagtgaattt 
acatgcccat 
acagtgttac 
tcttagtaac 
tggttcttct 
tgaccttaag 
aaacagtact 
aatggattca 
ctacttagaa 
tatttatata 
agagactgaa 
atatttcatt 
ctctttttat 
gaactgtttt 
ctatltgtgaa 



cacctccgcc 
atcatggact 
gtgcagttca 
gagaacctcc 
actgaagtgt 
ctggatagaa 
gatgcaggga 
gaaacccaaa 
tttcttactc 
aaactattgc 
aaagctgcag 
gatcgtcaag 
caccttcgct 
cagtatatgt 
aacaatcaga 
gccatggctt 
gagtgtagtg 
gaattatcac 
gactggctaa 
ctgctcaatt 
caattaactc 
aataccagcc 
acagaattta 
tgaggtttca 
ctactggatt 
ttgaaaaaaa 



tcgccgccgc 
tcaacgtgaa 
cagaagaaaa 
ttagcaaagc 
tattgcagcc 
aagctccaag 
ctgagtttgg 
aaagaattgg 
ctttaagaaa 
aaaataagag 
aaactagaaa 
cagagattac 
gtctgaatga 
tggacctcca 
cttctgtgac 
caacaagtgg 
gcagcagaaa 
ttctggcaga 
tgggggaaag 
aagtaggtgg 
taaataaagc 
atcagaaact 
tgttagagct 
ctaataagca 
tttcattaac 
aaaaaaaa 



taggtcggcc 
gaagctggcg 
gcttggccag 
tgaatgtacc 
aaatccaaat 
tcgtataaac 
cccaggaaca 
aacagcagac 
ctttatagaa 
actggatttg 
ttcatctgaa 
cagacttctg 
ctttgtagaa 
gaaacaactg 
acctgtacca 
cctagtaatc 
ggccagggtt 
tgaggtgatc 
gggaaaccag 
actatggaaa 
aggttaagta 
ggcctttctg 
ttcatgccaa 
gcttctactt 
agcaagcttt 



ggctccgccc 
gccgacgcag 
gctgagaaga 
aaaatatgga 
gccaggatag 
aacccagaac 
gcttatggta 
agagaactga 
ggagattaca 
gatgctgcaa 
caggaattaa 
ctagagggaa 
gcccagatga 
ggaagttttc 
tcagttttac 
acctctcctt 
ctctatgatt 
actgtgttca 
aagggcaagg 
ggttgcccat 
tcttccatgt 
ccaataaagt 
gaatgttttc 
ttgagcctca 
ttcttttatg 



<210> 78 

<211> 998 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1538 



2104180CB1 



<400> 78 



65/82 



WO 00/77040 



PCT/US00/16636 



ggggcggtgc cgggggcggg gcgcggcggc tgtcagctga ctgtggcggc ggcggcctcg 60 

aggtgacaac tgtctccgtc gcaggctccg gcgggggcgc aggaggtcgc ccggcgcgtc 120 

actgtcgggt cggcgagcca cgggggccgc cgcagcacca tggcgaccac cgtcagcact 180 

cagcgcgggc cggtgtacat cggtgagctc ccgcaggact tcctccgcat cacgcccaca 240 

cagcagcagc ggcaggtcca gctggacgcc caggcggccc agcagctgca gtacggaggc 3 00 

gcagtgggca ccgtgggccg actgaacatc acggtggtac aggcaaagtt ggccaagaat 3 60 

tacggcatga cccgcatgga cccctactgc cgactgcgcc tgggctacgc ggtgtacgag 420 

acgcccacgg cacacaatgg cgccaagaat ccccgctgga ataaggtcat ccactgcacg 480 

gtgcccccag gcgtggactc tttctatctc gagatcttcg atgagagagc cttctccatg 540 

gacgaccgca ttgcctggac ccacatcacc atcccggagt ccctgaggca gggcaaggtg 600 

gaggacaagt ggtacagcct gagcgggagg cagggggacg acaaggaggg catgatcaac 660 

ctcgtcatgt cctacgcgct gcttccagct gccatggtga tgccacccca gcccgtggtc 720 

ctgatgccaa cagtgtacca gcagggcgtt ggctatgtgc ccatcacagg gatgcccgct 780 

gtctgtagcc ccggcatggt gcccgtggcc ctgcccccgg ccgccgtgaa cgcccagccc 840 

cgctgtagcg aggaggacct gaaagccatc caggacatgt tccccaacat ggaccaggag 9 00 

gtgatccgct ccgtgctgga agcccagcga gggaacaagg atgccgccat caactccctg 9 60 

ctgcagatgg gggaggagcc atagagcctc tgcctcga 9 98 

<210> 79 

<211> 1086 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 2122241CB1 



<400> 79 

gacttgctga 

ccaaaatttc 

ccatttggga 

atatctgacc 

cagattatac 

tttattggct 

cacctgacag 

ctggagcggt 

aaaggaagtc 

gaaggtgtgg 

gaagtcggcg 

gggatgtgga 

cacggaaaca 

aggagatcca 

gggagcagaa 

aagatctgga 

tgcgggaaca 

tttctcaaat 

aaaaaa 



ggaaaggcat 
cccttcccct 
ttgccgccca 
tttctctcga 
ctggggcaac 
tctgcacgtc 
aagggactat 
gtcgggttag 
tgagagacga 
cccagggcca 
tgaggtgggg 
gctggagttg 
tgacagcggt 
gagttccaat 
agccaaacag 
gctaataatg 
catgggcaac 
atacctactg 



acgttaatgt 
ttcttccttt 
ctgacaactt 
cctaaacacc 
gaaccagaac 
ttcctcccca 
tcctagttta 
gtc tgagaga 
accgccttcc 
gccccgcctc 
cttatgcggc 
gagactgaga 
gcggcggact 
ctggagacgg 
gagcgggaga 
act gaga tgg 
gtggtagagg 
gattaattta 



atgtgtgagg 
tactcaggtt 
agtagggctt 
tggggctgga 
gacattagag 
gcttcggcca 
tgaggtggtt 
aggcctcgca 
tccctgaagc 
ctccccgggc 
ggcgtggtga 
ccagtggacc 
tggagcgggt 
ccatgtctgt 
aagaactggc 
agatatctcg 
cgcttattgc 
tggcaataaa 



ttaacccagg 
ctgtttcaag 
tcattataag 
aacatggcgg 
gagaaaagag 
ctctcccctt 
aaggatatcg 
caaaacactg 
ttctagaact 
ggaagctgtg 
aatagatatg 
agagcggcct 
caccgactat 
gattggagac 
aaaagtcact 
agcagcagca 
cctaaccaac 
attttttttt 



aggttgaatt 
ctgattccag 
caggcttgag 
gagagattta 
gcctgtcgtt 
cgcaccctct 
gtggggtggg 
tacaaacccg 
ggagcagaaa 
tcagttgccg 
gcgaccgagg 
ccggagaagc 
gcagaggaga 
agaaggtccc 
atcaagaagg 
gaacgcagtt 
tgatgcgtgc 
gtcttttaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1086 



<210> 80 

<211> 2323 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<223> Incyte ID No: 



2580428CB1 



<400> 80 

tacgccaccg 

tcagtgagcc 

ccctgctgtc 

acgccaggtg 

gatgacagcc 

ctggaactcc 

cgcggtggct 

gcattactgc 

gagaggaaga 



cgtccgagcc 
tgcgcaggag 
agtgcgcctg 
tagccatgca 
cgagaacccc 
atcccgacta 
ttgaagagcc 
cttgtcttga 
agctgcttag 



caaggcaaga 
gccaataggc 
cgcgcgggtc 
gcgagccgat 
ctcaaacacc 
caagacatgg 
ggtgctgctg 
tgagtctcgt 
ttatatccag 



gcgctacgcg 
tgccaatact 
cggcgccgag 
tccgagcagc 
ccttccgcag 
ggtccggagc 
aagaacatcc 
tttgaaaatc 
cgattggttc 



ccctgcccga 
ccttggactc 
gttcttgact 
cctccaagcg 
aggcagactg 
aggtgtgctc 
gagaaaatga 
ttggagtaag 
aaatccacgt 



agtgcaactg 60 
cccgccaggt 12 0 
gctgtgccgg 180 
tccccgttgc 240 
gtccccgggc 3 00 
cttcctcagg 3 60 
aatcacaggc 420 
ttccttgggg 480 
tgatacaatg 540 



66/82 



WO 00/77040 PCT/US00/16636 

aaggtaatta atgatcctat ^Ktggccac attgagctcc accctctcct cgtccgaatc 600 
attgatacac ctcaatttca acgtcttcga tacatcaaac agctgggagg tggttactat 660 
gtttttccag gagcttcaca caatcgattt gagcatagtc taggggtggg gtatctagca 720 
ggatgtctag ttcacgcact gggtgaaaaa caaccagagc tgcagataag tgaacgagat 780 
gttctctgtg ttcagattgc tggactttgt catgatctcg gtcatgggcc attttctcac 840 
atgtttgatg gacgatttat tccacttgct cgcccggagg tgaaatggac gcatgaacaa 900 
ggctcagtta tgatgtttga gcaccttatt aattctaatg gaattaagcc tgtcatggaa 960 
caatatggtc tcatccctga agaagatatt tgctttataa aggaacaaat tgtaggacca 1020 
cttgaatcac ctgtcgaaga ttcattgtgg ccatataaag ggcgtcctga aaacaaaagc 1080 
ttcctttatg agatagtatc taataaaaga aatggcattg atgtggacaa atgggattat 1140 
tttgccaggg actgccatca tcttggaatc caaaataatt ttgattacaa gcgctttatt 1200 
aagtttgccc gtgtctgtga agtagacaat gagttgcgta tttgtgctag agataaggaa 1260 
gttggaaatc tgtatgacat gttccacact cgcaactctt tacaccgtag agcttatcaa 1320 
cacaaagttg gcaacattat tgatacaatg attacagatg ctttcctcaa agcagatgac 1380 
tacatagaga ttacaggtgc tggaggaaaa aagtatcgca tttctacagc aattgacgac 1440 
atggaagcct atactaagct gacagataac atttttctgg agattttata ctctactgat 1500 
cccaaattga aagacgcacg agagatttta aaacaaattg aataccgtaa tctattcaag 1560 
tatgtgggtg agacgcagcc aacaggacaa ataaagatta aaagggagga ctatgaatct 1620 
cttccaaaag aggttgccag tgctaaaccc aaagtattgc tagacgtgaa actgaaggct 1680 
gaagatttta tagtggatgt tatcaacatg gattatggaa tgcaagaaaa gaatccaatt 1740 
gatcatgtta gcttctattg taagactgcc cccaacagag caatcaggat tactaaaaac 1800 
caggtttcac aacttctgcc agagaaattt gcagagcagc tgattcgagt atattgtaag 1860 
aaggtggaca gaaagagttt gtatgccgca agacaatatt ttgttcagtg gtgtgcagac 1920 
agaaatttca ccaagccgca ggatggcgat gttatagccc cactcataac acctcaaaaa 1980 
aaggaatgga acgacagtac ttcagtccaa aatccaactc gcctccgaga agcatccaaa 2040 
agcagagtcc agctttttaa agatgaccca atgtgaatgt ctgtagtcag ttgtttacaa 2100 
actccctctc ctgcacaatt catttagagg cttcaatcat agaattctgc aaattaatga 2160 
caactcatgc tttaattttg tattttgaat gtacacgcat gctgaagcta agtaactttt 2220 
aatcaaagaa ataagatggt attaggcaaa tcttactata ctatgaaaag cattaccttg 2280 
cctattttta atattattaa agcctttctc cttcaaaaaa aaa 2323 



<210> 81 

<211> 669 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 3397189CB1 



<400> 81 

cccacgcgtc cgaacgccat ggctcccaag aagctgtcct gccttcgttc cctgctgctg 60 

ccgctcagcc tgacgctact gctgccccag gcagacactc ggtcgttcgt agtggatagg 12 0 

ggtcatgacc ggtttctcct agacggggcc ccgttccgct atgtgtctgg cagcctgcac 180 

tactttcggg taccgcgggt gctttgggcc gaccggcttt tgaagatgcg atggagcggc 240 

ctcaacgcca tacagtttta tgtgccctgg aactaccacg agccacagcc tggggtctat 300 

aactttaatg gcagccggga cctcattgcc tttctgaatg aggcagctct agcgaacctg 3 60 

ttggtcatac tgagaccagg accttacatc tgtgcagagt gggagatggg gggtctccca 42 0 

tcctggttgc ttcgaaaacc tgaaattcat ctaagaacct cagatccagg tgagttgaga 480 

caaaggattt aacacagaag caagtaagta aaatgggcta tttgggtgcc aaaagcagaa 540 
gagaccattc ccaaattgga ggtcatcatt catttaccaa gtgtttcctt catgcccagc 600 

aggatgctag aaactggggg accagacaga cccccatcct tgtccagcag gcttatgata 660 
tggtcaatc 669 

<210> 82 
<211> 1606 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 4881249CB1 



<400> 82 

gcggcggaga cggccggcgc ccggcgcgga 

gcctcgtcga gaaggatgct gtcccgaaag 

gaggtgcagg ggaagtacgt gaagaaggag 



gcctagggag gcagttcagc gcggcctcgg 60 
aaaaccaaaa acgaagtgtc caagccggcc 120 
acgtcgcctc tgcttcggaa tcttatgcct 180 



67/82 



WO 00/77040 



PCT/US00/16636 



tcattcatcc ggcatggtcc aacaattcca agacgaactg atatctgtct tccagattca 240 
agccctaatg ccttttcaac ttctggagat gtagtttcaa gaaaccagag tttccttaga 300 
actccaattc aaagaacacc tcatgaaata atgagaagag aaagcaacag attatctgca 3 60 
ccttcttatc ttgccagaag tctagcagat gtccctagag agtatggttc ttctcagtca 420 
tttgtaacgg aagttagttt tgctgttgaa aatggagact ctggttcccg atattattat 480 
tcagacaatt tttttgatgg tcagagaaag cggccacttg gagatcgtgc acatgaagac 540 
tacagatatt atgaatacaa ccatgatctc ttccaaagaa tgccacagaa tcaggggagg 600 
catgcttcag gtattgggag agttgctgct acatctttag gaaatttgac taaccatggt 660 
tctgaagatt taccccttcc tcctggctgg tctgtggact ggacaatgag agggagaaaa 720 
tattatatag atcataacac aaatacaact cactggagcc atcctcttga gcgagaagga 780 
cttcctcctg gatgggaacg agttgagtca tccgaatttg gaacctatta tgtagatcac 840 
acaaataaga aggcccaata caggcatccc tgtgctccta gtgtacctcg gtatgatcaa 900 
ccacctcctg tcacatacca gccacagcaa actgaaagaa atcagtccct tctggtacct 960 
gcaaatccat atcatactgc agaaattcct gactggcttc aggtttacgc acgagcccct 1020 
gtgaaatatg accacattct gaagtgggaa ctcttccagc tggctgacct ggatacatac 1080 
cagggaatgc taaagttgct cttcatgaaa gaattggagc agattgttaa aatgtatgaa 1140 
gcatacagac aagcccttct tacagagttg gaaaaccgaa agcagagaca gcagtggtat 1200 
gcccaacaac atggaaaaaa tttttgagct gattttttaa aaatttaagt tttgtaagag 1260 
ctttaaaata ttttcacaga taaaaaattg caaacaagta ctctggttaa taaatgctgc 132 0 
ttcctttgtg gaaattataa aattctaact ttacatgtat tttgttatta gaaattttct 1380 
tttattgaat gagaaaaatt agtctatcat tttaagagcc aatatggcaa acactttcaa 1440 
atactgtata ttaggaaact gttttggtat tcttgatgga aaaaaatgca gcggaaatgt 1500 
cattatgaac agatgttaaa taggaaatta ttacttgtta acttcttaca gcagtagtac 1560 
cttctttaaa aaaaaaaaga atctgcggta ttttttttaa aaaaaa 1606 



<210> 83 

<211> 1980 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 43187 1CB1 



<400> 83 

ggctgagcgt 

ggtaccatcc 

gtgggccgag 

tgggacagca 

cttctggcca 

ggggcactga 

tgcaggaggc 

agtgacgtgg 

gcacgtagtg 

gttctcaggc 

acaggccgcc 

tgccagctgt 

gtggcgtcta 

ccccgcctcc 

ggtgatcttt 

tgcttccgag 

gactacttcc 

gggggccccc 

tactacatgt 
gtcgccgaca 
atgctagacg 
cggcttgagg 
gaggacttcg 
gactctatcc 
agcgccatag 
ggtctggact 
aagcatgtgt 
atggggggtg 
gggaggatcc 
tgggggtgga 
ccttctccag 
gccctgaatt 



gtttacatcc 
tccgcgccat 
tgcggtacct 
cccccttgta 
atggagcccg 
gtgaccccat 
gggattacta 
tctttgtagt 
cctactttgc 
acccactgat 
tggacattgg 
gggacctgct 
agccaggcac 
gggaggacat 
gggagctgcc 
tggctggctg 
gagccctgct 
cagccgtcac 
acagcgacca 
tgtacctgct 
aggacactgt 
accagtgcac 
tggaggcggt 
cgctggtgga 
aggaggcgca 
gttgagcccc 
atgcgtttgt 
cgaggggctc 
atttgggatg 
caccactcag 
atccccccta 
gcttctctaa 



gccgggtgcg 
ggacaccagc 
gctggagcag 
ctatgcctgc 
ctgcgaggcc 
ccgccgggct 
tgacgacttc 
acacgggaag 
caacatgctg 
caaccccgtg 
cgtagagcat 
cagcgacctg 
gtgtgtgaag 
ggcgctgctg 
cttcccttgt 
cagcttcctc 
ggatgaccac 
cctgcatggc 
cactgagctg 
gccaggcctg 
ggtgggtgtg 
tgagtacatg 
gaaggaggag 
cgacatccgc 
gcagcgtctg 
tggctgggca 
gtgcagctct 
agtggggctt 
agccccctcc 
ggaaacctgg 
cccaccccag 
gctggtgttc 



cggcttcgcc 
gacctgttcg 
cgagacgtgg 
ttgtgtgggc 
aacaccttcg 
ctacgcgatt 
ttgcagcggc 
ccattccggg 
gacaccaaat 
gcctttgggg 
gtgagtgact 
gaggccaagt 
gtgctgacca 
gccgattgtg 
cctgacggct 
tgccacaagg 
ttccgagaga 
atctcacccg 
tcccccgagg 
aagaggctgt 
tggcgcgtgg 
gccaaggtca 
gcagcggctg 
ttccacgtgg 
cgggcactcg 
gccccagggg 
tcttcctgct 
ctcttccctc 
ccccaatgca 
ggtgggggtg 
tcccaaatcc 
ccatgcacag 



gcccgaggtc 
ccagctgcag 
aggtgaatgt 
acgaggagct 
atggtgagcg 
acaagcaggt 
ttctagagca 
tgcatcgctg 
ggaagggcaa 
ccctgctgca 
gtgagcgcct 
gcgagaaggt 
tcgagccccc 
ccctgccccc 
tcaacagctg 
cctttttctg 
gcgaggagcc 
acgtcttcac 
cagcctatga 
gcggccgcag 
ccaagctctt 
ttgagaagct 
tggcagcccg 
ccagcacggt 
aggacctgct 
ccaggagctc 
ccctgcacat 
catgagcctg 
caagccagcc 
ggctttggtc 
agtcctctgg 
ggccattcag 



gttcggctcg 
gaagggggat 
gcgggacaag 
ggtactctac 
ctgcctctat 
cacggcttcc 
gggcatccac 
cgtcctgggt 
gagtgtcgtg 
gtacctgtac 
ggccaagcaa 
gtctgagttt 
acctgcagac 
cgagctccga 
ccctgacatc 
tggccgcagt 
agcgacctca 
tcacgtgctc 
tgtgctgagc 
cctggctcag 
ccgcctggcg 
ggtggagcgg 
gcaggagacg 
gcagacctac 
cgtgtccatc 
tcttggagac 
tgagggcttc 
gagaccccag 
cccaagaccc 
ttagcacttt 
cccttgccta 
gaagggctgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 



68/82 



WO 00/77040 



PCT/USOO/16636 



gggagtgtgt gtggcaataa agcttgaagg caccgtggga gcatgaaaaa aaaaaaaaaa 1980 

<210> 84 
<211> 1449 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 526155CB1 



<400> 84 

gcccgccata 

aacagctgcc 

caggtagcgg 

actcagtgca 

gctcctagcc 

ggcccctggt 

tgagagtgac 

tgactactta 

tcgacacctc 

gctggaggct 

gtatgctgac 

cagcatcggg 

atggtgtgtg 

ccaacacaag 

taaaaaaacc 

gcaacacctg 

gaaagcctca 

aaagaactgt 

agtcctcaga 

acctccccta 

tgtttttgtt 

atgctctctc 

gagtaggtgg 

ttctgccacc 

ttgaggagc 



gagtttagtg 
ggaggtgacg 
cccgcagagc 
gtgatgagtg 
aagtcggcca 
gtctacgtgt 
tttgcctcta 
gctgaagccc 
tcagttgtca 
cttgccctgc 
gtgcttcgtg 
cgggacatcc 
ggctgtgagg 
gagcagcagc 
attaaagtta 
actgagctga 
aagggcaagg 
cgtttcctcc 
gagccttctg 
accccaaaca 
ggtactttct 
ccttgtttcc 
gtagccctga 
ccatccctgc 



gccagagcga 
gagcggcggc 
ctgacccagg 
cggaagtgaa 
agggggcagc 
ttggagaact 
ccttccggct 
ggaatcttcc 
ccctggctgc 
gtaatgtgcg 
gctccctgga 
agcgccagga 
tcgtgctgtc 
tgggcctgaa 
cgacggcagc 
gggaaccagc 
ggctccgagg 
ctggggatgt 
tgcccctggc 
tagatcacac 
gttttttgtg 
ttaagagctc 

tgggggtcgc 

atgcctgatc 



ctcttcaggg 
cccgcccggt 
ctctggacat 
ggtgacaggg 
gctggccaca 
gctggacatg 
gctcacagtg 
tccactaaca 
taaagtaaag 
gcagctggaa 
ccagcgcaac 
cctcagtgcc 
aggcattgag 
gcagcagatt 
agcagccgca 
tcctggcacc 
gagcgccaag 

ggggtcccag 

cagctgataa 
cttctctagg 
acttcatgtg 
agcatctgtc 
tctgtctgga 
cccagttcct 



aggtggcagg 
gcgctggagg 
cctgagccca 
cagaaccagg 
ctcatccatc 
cccaatgtta 
tttgcttatg 
gaggctcaga 
tgtatcccat 
gaccttgtga 
cagcggctcg 
attgcccgaa 
gagcaggtga 
gagagtgagg 
gccacatctc 
aaccagcgcc 
atttggtcca 
ctgcctgcct 
tcctaggttc 
gaggagtcaa 
ttccattgct 
cctgttcatt 
gcataaccca 
attaccctac 



aaaggcttgg 
tcgaagcttc 
agtcccccac 
agcaatttct 
aggtgctgga 
gagagctggc 
ggacatacgc 
agaataagct 
atgcagtgtt 
ttgaggctgt 
aggttgacta 
ccctgcagga 
gccgtgccaa 
ttgccaacct 
aggaccctga 
agcccagcaa 
agtcgaattg 
gcctcttagg 
atgacccttc 
atgtaggtca 
ccccgctgcc 
acatgtcatt 
caggcgtttt 
ccctgaccta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1449 



<210> 85 
<211> 1231 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 676234CB1 



<400> 85 

cctcctgctg 

ggccgcctgg 

tgctgggcat 

ccctgtggaa 

agggcctgtg 

actcactgct 

tcctgctggc 

tggaggacga 

ccggcatcct 

acaaccccct 

gggcggcggc 

cccaggtcga 

catctggact 

ctccccactg 

acctccttcc 

gggtcagctg 

ttgggcagcc 

cttccgttgc 

gaaccctggc 



gacacagaga 
ggctgagaag 
gaccctggct 
ggtgaccgcc 
gatgtcctgc 
ggctctgccg 
cctgcttggc 
aggtgccaag 
ggtgctcatc 
ggtggctgag 
tgcactgctt 
gcggccccgc 
ggacaagagg 
ccccgccctt 
ccaggaaaac 
gcctggctga 
tagaggtgtt 
ccaggacact 
ctcaccaaaa 



cacccaccca 
acctaaccga 
gtgctgggct 
ttcatcggca 
gtggtgcaga 
caggacctgc 
ctcctggtgg 
gcccgtatcg 
cctgtgtgct 
gccctcaagc 
atgctgggcg 
ggacctcggc 
gactacgtgt 
tcgaccttgg 
ccactttcca 
gctcttctca 
gggaaccctg 
ggctggcctt 
cagcagcagc 



gcacaccaga 
ggggccagat 
ggctggggac 
acagcatcgt 
gcacgggcca 
aggccgcacg 
ccatcacagg 
tgctcaccgc 
ggacggcgca 
gggagctggg 

gggggctcct 

tgggctactc 
gaggcggagg 
cctgatgacc 
aaagcccaag 
gtggggtccc 
gcctgccccc 
ccttctcttc 
tcgttggctc 



cacaccctct 
ggcttcgacc 
cctggtgtcc 
ggtggcccag 
gatgcagtgc 
tgccctctgt 
tgcccagtgt 

gggggtcatc 

cgccatcatc 
ggcctccctc 
ctgctgcacg 
catcccctcc 
tttcccctgg 
agatgccctg 
ctacacctgg 
ctttgatgtt 
acctccccag 
tgagccctcc 
caaaaccagg 



gagtcaccta 
ggcttagaac 
tgcgccctgc 
gtggtgtggg 
aaggtgtacg 
gtcattgccc 
accacgtgtg 
ctcctcctcg 
caggacttct 
tacctgggct 
tgccccccgc 
cgctcgggtg 
gagcccactg 
ctccatcaca 
ctgcagggct 
ctcccccaag 
taattgtttc 
cctgccccag 
gagcagacca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 



69/82 



WO 00/77040 



PCT/US00/16636 



tgccctccca accctggagt tgtcagggag ggcctgccca tcacctccct ctccccaaca 1200 
tccccaccct cgagttggaa ataaagagca t 1231 

<210> 86 

<211> 858 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> incyte ID No: 720145CB1 



<400> 86 

gggccgcggc 

ctggagctca 

gtctttctag 

gcccacccgt 

catataacaa 

actacacatc 

tgatactgta 

acttaaatcc 

aagaaaaaca 

ttgattcatg 

gatttatatt 

ttaatttcag 

tatattatta 

tatttttgtg 

gttgaaagta 



gaccccgggt 
cggtcggcgc 
aaaggcctag 
gaggagccag 
cagggggcaa 
tgccatagaa 
taggctggga 
tggatttcaa 
aagaagaaat 
atccttacat 
taagaaatta 
gttgaatggg 
tattaaatat 
atttttttat 
taataaac 



ggaggcgacg 
ttgtcataac 
gcgaggccta 
ctgccgcgac 
atcaagtact 
gtccaaccca 
tattttgatg 
gatgctactt 
gttgcaaaaa 
ctgcatctag 
atacattagc 
ttttttttaa 
tatagtaaaa 
tctcaacttt 



ttggggttgt 
ccagcactag 
ggacgagggc 
gcaaatgcag 
tcagggttga 
attttgaagt 
atgctttgga 
tgagcttaaa 
attattgata 
ttatcagtaa 
actgaaagtt 
tgaagtgtaa 
aggaatgtgt 
ttatttaaaa 



ttgggctcct 
gactgagcgg 

ggcggagaac 

ccctgaggat 
tttttatgaa 
tccatattac 
agatttcaag 
acagactatt 
tttttaactt 
tttagatatt 
aaatagtgtg 
ataataccaa 
ggtattttct 
aatgttacat 



atctttgcct 60 
gcggacgttc 120 
gcggggagaa 180 
ttggctactg 240 
gccatggatg 3 00 
aacagagggt 3 60 
aaggtcttag 420 
ctagacaaag 480 
aatggaagta 540 
gagctatttt 600 
tttaaggtag 660 
tgtataagtg 720 
tcagcaaaac 780 
ctgcagaaaa 840 
858 



<210> 87 

<211> 1748 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1001951CB1 



<220> 

<221> unsure 
<222> 65 
<223> a, t, c, 



g, or other 



<400> 87 

cgcatgcccc 

ggggnggggc 

gcagttcttt 

cctcactgca 

tctcttacag 

caatgttgca 

cgccttgctg 

gctgcaccat 

aaggagagag 

caaaccttca 

gctgtgagaa 

tccccccatg 

gaggaagctc 

ctacaaatgt 

tgccttcatt 

ctgtctactc 

gcaggactgg 

gcagcagtgg 

ttatttatcc 

tggctcctca 

gatctgattg 

ctcacagttc 

gggcccctgc 



gtgcttcgat 
aaatgcagct 
ttccagaaac 
aacagtccta 
ctgttgccag 
gaaaagggtc 
cacacactag 
caagcatggt 
agggggctct 
tccggagctg 
ggcagctcct 
tggctgtgct 
cagagaagat 
tgcaacctgg 
acatcatctg 
tggggttgct 
agctgctggc 
agactgtggg 
ttctgcagtt 
acaacctcac 
agcctctctt 
tgtgcaatgt 
ttcccgcctt 



gtctaagcaa 
cagagatgag 
agcccagtct 
gtttctgtac 
tatctaacgt 
ctgcttactg 
ccttttctga 
gcagcagttc 
ggtcagcctt 
tgtctgtata 
gccacagggc 
ggaagctctc 
cattccctcc 
cccaaagctc 
cagccaggtc 
gctgttggac 
atgccccgtg 
agggcaaatg 
ctttttccag 
tgcaaacagt 
acagctgttg 
tgcagaaaag 
gctgcacaca 



cctgctaact 
cgttgttgtg 
gctccctgag 
ctccttgctc 
ggtgagcgta 
ccagcggctg 
cactgaagta 
ctgcggcaag 
cgtcgaggct 
cactgggtaa 
attgttccag 
ggatatgcct 
atcttggcct 
aaccctgggg 
agcaatcctc 
ttggctgggg 
cttcgatgtc 
cagctcagag 
aaacagccca 
cctagtttct 
ccagtatcta 
ggtcctgctt 
ctagcctttt 



gaggcagcag 
gcagccttat 
ggcctttggc 
tccctggatc 
atggtgctca 
tggccagggc 
gtaggccaga 
cccagcgggg 
tgcagcaccc 
ccctgatcgt 
ccttggctgc 
tgtcccagct 
ccactctccc 
tcgctgtgga 
tgctcattgg 
ctgtccagaa 
taagcaacct 
atgagcgtgt 
gtctgctccc 
gtacctcctt 
acgtggtgag 
actgccagcg 
ctgacactga 



tggagactgt 
ttatccttct 
tcctcaacaa 
tgattgagcc 
cagttctgtg 
ccctgcttcc 
gtttggaact 
gacagaggaa 
tgaaacacag 
ggagagtgag 
ctgcatccag 
tctacaggct 
tcagcacatg 
gtttgcctgg 
ccatggggct 
aaccgaggat 
gctaactgag 
tgtggcagcc 
tgagggcctt 
gctctccctg 
cgtaatggtg 
gctgtggcca 
agtagtaggc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 



70/82 



WO 00/77040 



cagagtttgg 
ctgcagcagt 
cgtgtgtatg 
attcccctct 
ccataatgag 
tcctccccct 
tctactta 



agctgctgca 
cagggctgca 
ctctccagca 
ctcttaacat 
gtctcatgtt 
tccactcagc 



t^ffjctgttc 

agcctggaaa 
gacagctctt 
caagcttgtt 
ctctgctccc 
actatccagg 



ctgtatcagc 
aggcatcagg 
caagggtgat 
ttgtccagta 
acacctaagc 
caaggaggac 



PCTAJS00/16636 



cagaggctgt TCaggtcttc 1440 
aagaggccca gctccaggat 1500 
cttgtttctc aatgtcactc 1560 
gagcctttgg agatttagga 1620 
caagaccttt gggtcccagc 1680 
caaaaaggga ctcagtgtgg 1740 

1748 



<210> 88 
<211> 4240 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1243349CB1 



<400> 88 

agctttgcca cagaaagcag tgagcaagaa tgatagctgt ctcttttaaa tgccgttgtc 60 
aaattctgag gcgacttact aaagatgaga gtccctacac taaatccgcc agccagacaa 120 
agccgcctga tggagcgttg gctgtgagga gacagagcat cccagaggaa ttcaagggct 180 
ccacagtcgt cgagctgatg aagaaggaag gcactaccct gggtctgacg gtatcgggag 240 
gaattgataa ggatggcaag ccaagagtat ctaatctgcg gcaaggagga attgctgcta 3 00 
gaagtgacca gctggatgtg ggtgactaca tcaaagcagt gaatggaatc aacctggcca 3 60 
aattccgcca tgacgagatc atcagcttgc tgaagaatgt gggagaaaga gtggttcttg 420 
aagtagagta cgagcttcca ccggtctctg tgcaaggatc aagtgttatt ttccgaacag 480 
tggaggtcac attacataaa gaaggcaata cctttggttt tgtaattcga gggggagcac 540 
atgatgatag aaataaatct cgtccagttg tgataacatg tgttcgtcct ggagggcctg 600 
ctgacagaga gggcacgatc aaacccggtg acaggttgct cagtgtggat ggaattcggc 660 
ttcttggaac tacgcatgct gaagccatga gtattcttaa acaatgtgga caagaagcag 720 
cactgctgat agaatatgat gtctcagtaa tggactctgt ggcaacagca tccgggccac 780 
tactagtcga agttgccaaa actcctggtg ccagccttgg ggttgcccta actacctcga 840 
tgtgctgtaa caaacaagtc attgtcatag acaaaatcaa atctgcaagt attgcagaca 900 
gatgtggcgc attgcatgtg ggagatcaca tcctctccat cgatggaacc agcatggagt 960 
actgtacact tgcagaagca acccagttcc tggccaacac cactgaccag gtcaagcttg 102 0 
agatccttcc ccatcatcag acccggctgg ccctaaaggg gcccgaccat gtgaaaattc 1080 
agaggagcga caggcaactt acctgggatt cctgggccag caaccacagc agccttcaca 1140 
ccaaccatca ctataacacg taccaccctg accattgcag agtaccagcc ctgacattcc 1200 
cgaaagcacc tcctccaaac agccctccag ctttggtgtc ttcatccttc tctcctacct 1260 
ccatgagtgc atacagcctg agttccctga acatggggac tctacctcga agcctctact 1320 
ccaccagccc acgtggaacc atgatgagga ggagactgaa aaagaaagac ttcaaaagct 13 80 
cattgtcctt agcctccagc acagtaggat tggctgggca ggttgttcac acagaaacca 1440 
cagaggttgt tctgacggca gatcctgtca caggatttgg gatccaactg cagggcagtg 1500 
tgtttgccac agaaactctc tcttctccac ctctgatttc ctatatcgaa gctgacagcc 1560 
cagcagagag atgtggggtg ctacagattg gagacagagt gatggccatc aatggaattc 1620 
caacagaaga cagcaccttc gaagaagcca gtcagctcct ccgagactct tcaatcacga 1680 
gcaaggtcac actggaaatc gagtttgatg ttgcagagtc tgtcatccca agtagtggaa 1740 
catttcatgt aaagctgcct aagaagcaca atgtggaact tggaataacc ataagttcac 1800 
catccagtag aaaaccagga gaccccctcg tcatttcaga tatcaagaaa gggagtgtgg 1860 
cacacagaac tgggaccctg gaacttgggg ataaattgct cgcaatagat aatatccggc 1920 
tggacaactg ttccatggaa gatgcagttc agatcctcca gcaatgtgaa gacctggtga 1980 
agctcaaaat ccgcaaagat gaagataatt cagatgagca agaaagttcc ggagcaatta 2040 
tttacaccgt ggagcttaaa cgctacgggg ggccccttgg catcacaatt tcaggaactg 2100 
aagagccgtt tgatcctata atcatttcaa gcctcactaa agggggatta gctgaaagaa 2160 
ctggcgcaat ccacatagga gaccgaatcc tagccatcaa tagcagcagc ttgaaaggga 2220 
agcctctgag tgaagccatc catttgttac agatggcagg agagactgtc accttgaaaa 2280 
ttaagaaaca gacagatgcc cagtcagcat cgagccccaa gaagttccct atttctagcc 2340 
atttgagtga cctgggggat gtggaggagg actcctcacc agcacagaag ccaggcaagc 2400 
tctccgacat gtacccctcc acggtgccca gtgtggacag tgctgtggat tcatgggatg 2460 
ggtctgcaat agacaccagc tatggaactg aaggcactag ttttcaggcc tcaggataca 2520 
atttcaacac ctatgactgg aggagtccaa aacagagagg cagcttgtcc ccagtcacta 2580 
agcctcgaag ccagacttac ccagatgtgg ggctgagtta tgaagactgg gaccggtcca 2640 
cagccagtgg ttttgcaggg gctgccgata gtgcagagac agaacaagag gagaacttct 2700 
ggtctcaagc gctggaggat ttggaaacct gcggacagtc aggaattctg agagaactgg 2760 
aggcaacaat catgtcgggg agcacgatga gtttgaatca tgaggctcca acacctcgca 2820 
gtcagctggg gcgacaggcc agcttccagg agcgcagcag ctcgcggccg cactacagcc 2880 
aaacaactcg gagcaacacc ctgccttcag atgtgggtag gaagtcagta accctgagaa 2 940 



71/82 



WO 00/77040 



aaatgaaaca 
ccttgtacaa 
agaaaggagt 
agccctatga 
ttgttgtgcc 
cactggcttc 
acagtgcttt 
atacattata 
aaatccttta 
tgcaagcaca 
tgaagaggtt 
caacggtaga 
tttacaagtt 
ctgtgatgat 
agaacttggc 
tggtaaactg 
aaatgtttca 
acgtatagga 
aggatatgtc 
tgataatgaa 
attcaagact 
taggcaaacc 



agaaataaag 
ggactctgac 
gtatgtcaaa 
caggctctta 
cctcatagca 
acagaagtct 
tttccagcag 
gcaacgcttt 
agatttctgt 
ggggtctttt 
tcttttttgg 
gaaaaatagg 
catcttacta 
ttccttaatt 
caacaaagct 
ctgggactac 
gcaatgatgc 
atgattttaa 
ttgtgacctg 
ttagagtgtg 
aagaacttca 
aaactcatac 



atggaggact 
aatattcgcc 
caggtgaatc 
gaatccggga 
atagaccaac 
cctagccacg 
ttataaagca 
gacctttgaa 
aaatctctct 
tgaccactat 
ccgtccccca 
tcacttttta 
cactttgaaa 
tttcttctca 
cacaatggga 
acgtaggaga 
ctaagacgta 
tttttttttt 
tagtaaacat 
gagaatgaag 
tgcttgacat 



PCT/US00/16636 



ctccaactcc tgtggagctg cacaaggtga 
ttgggttcag tgtagcagat ggcttactgg 
cagctgggcc aggagatctt ggtggcttaa 
atgtccgaac cagagacttt gactgctgcc 
ataagctgga cctggttatt agtagaaacc 
agagtctacc aggagattgg agtgaacaga 
gtggtaattt ggagacacga gaacccacta 
ggacaaaaga caatatctac atggtgctaa 
gcacagataa tcaatcaacg tggcattaac 
catggctcat gttcacttcc cttttcaagt 
ggtatatggt gggcaatgcc ctgccagtcc 
caactctaca attaacatca gaggaaattt 
aaaagagaaa catctgtttg aaaatattct 
tcagtttctt actatgaagt cattaatgta 
taggctggct ctactagggg aactagtgtt 
ggggtacagg tataaaatta agttatctta 
ccataatagg tggtggtaaa tgttttgccc 
tgctattccc tatgcaacaa attatcaaac 
aaggacacat ttttaatagc tgaaaatctc 
gagaattagt tatattatct tatttttaaa 
agtctattaa aatgaggttc atcttaatga 
gtztttgaaaa 



3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4240 



<210> 89 
<211> 2317 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 



1338201CB1 



<400> 89 

ataaccctca 

tgaaccggct 

gaaatttaat 

gaaaagccta 

caactgaact 

gtcttgatga 

tagaacttga 

ataaattaga 

aaggaagccg 

cttctcgtca 

aagagaagag 

attattatca 

cctgtgttga 

ccccagcctc 

atgtggttag 

cttcaggcag 

ccccgaattc 

gatcagtttc 

tcccgcacac 

cgctgctcat 

cgaggggttg 

tgaccgtgcc 

agaatagcag 

ctgccgacag 

agcccgagac 

gagaaaaccc 

ccatcattcg 

gaatgatggg 

ctaactctaa 

acaagtttct 

atattttctt 

tttaaatgaa 

tggctttctg 

tagcattatg 



ctaaagggaa 
cacggagagc 
aaacctgggg 
ctacgatgga 
gggacatgtc 
aaattttaaa 
cgtgaaatat 
gtctttggag 
aaacgcactc 
gagtgaaatc 
gcgcttctgc 
cttacagtct 
tgccatcaaa 
tacccccgtg 
gaaagattac 
agcatatacc 
acaaagggta 
ggttgcaacg 
tgcgggctcc 
ccccgaggag 
gttcccgtcg 
cacgccaagc 
tgttgtcatc 
gagagcagat 
cgcggctcct 
ctttgccact 
atgagaggac 
cgcatcctgt 
tgtcgcctgc 
gcctttaaga 
cctattctgt 
ttttttcctt 
cacctgatat 
tctaaagaag 



taagcttgcg 
acctaccgga 
aaaaattatg 
gtggccaaga 
ctcatagaga 
aaattccaca 
atgaacgcaa 
aaatcccaag 
aaatatgaac 
cagaaattca 
tttctggttg 
gcagaactac 
gtgccagaga 
tctggaactc 
gacacccttt 
agtcccttga 
aataattcaa 
ggactgaaca 
aacaagacct 
aaggatggct 
tcgtacacga 
cccacaccag 
cccccacccg 
tcggccagga 
aacgatgcca 
gtgaaactcc 
agccaaggac 
ctgccacgtg 
ttaagcaaat 
taaatgagta 
tcaagaaaca 
aataacagcc 
gaatgagacc 
atatcacgta 



ccgcagccat 
atgttatgga 
agaaagctgt 
tcggtgagat 
tttcaagtac 
aagagattat 
ctctaaaaag 
ctgagttgaa 
acaaagaaat 
ttgcagatgg 
ataagcactg 
tgaattccaa 
aaatcatgaa 
ctcaggcttc 
ctaaatgctc 
tcgatatgtt 
caggtacttc 
tgatgaagaa 
tactcagctt 
ggctctatgg 
agttgctgga 
tgagaagcat 
actacttgga 
cgacatccac 
acgggactgc 
gcccgactgt 
tctcccgggc 
ctgacggtcg 
catgcttctc 
atagtctaat 
gtaaacttgg 
agaataaggg 
agttttattt 
agtttgcatc 



gtcccggggg 
acagttcaat 
aaacgctatg 
tgccactggg 
ccacaagaaa 
ccatgagctg 
ataccaaaca 
gaagatcaga 
tgagtatgtg 
t tgcaaagag 
tggctttgca 
gctgcctcgg 
tatgatcgaa 
acccatgatc 
accaaagatg 
taataaccca 
cgaagatccc 
gcagaaagtg 
tgcacaggga 
agaacacgac 
agaaaatgag 
cagcaccgtg 
atgcttatcc 
ctttaaggcc 
aaagccgcct 
gacgaatgat 
ctctccggtt 
ggaagcttca 
tgtttcacgt 
gaccagctca 
tttcaatctt 
atagtctatg 
tataaagcat 
ttagcatgca 



cccgaggagg 
cctgggctgc 
atcctggcag 
tcccccgtgt 
ctcaacgaga 
gagaagaaga 
gaacacaaga 
aggaaaagcc 
gagaccgtta 
gctctgcttg 
aaccacatac 
tggcaggaga 
gaaataaaga 
gagagaagca 
ccccccgctc 
gccacggctg 
agtttacagc 
aagaccatct 
gatgtcatca 
gtgtccaagg 
acagaagcag 
aacttgtctg 
atgggggcag 
ccagcgtcca 
tttctcagcg 
cgctcggcac 
ctcccttgcg 
gtggagaggc 
agttgggttg 
gccatttaaa 
tactgtattt 
ctttcaggac 
gtgctcttaa 
aatcataatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 



72/82 



WO 00/77040 ^ PCT/US00/16636 



afifeactata taaatgtaat ttaacttaaa^^l 



ttaagcaata taaattatga alBtactata taaatgtaat ttaacttaaa atgtttaagt 2100 

gtagagcttc cagagatggg ggaaaccccc accctccctc caaccacgcc agagctgtag 2160 

gagtgctaag acgctttggc tgcccttatc acagcccacg tagcatcgga ggaacctctc 2220 

cgggagctcc tttcctggga gtttggagtt gccacgggtt cggttgaaag ggtcatccgg 2280 

gaccagcctg ggggtggagt tcagggcctt ggtttct 2317 

<210> 90 
<211> 3899 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> incyte ID No: 1405141CB1 



<400> 90 

cggcacgagc tcgtgccgaa ttcggcacga gaaaccagct gttccgagtc ctcaccagag 60 
ccacacagga agaagtcgcg gttaacgtga ctcgggtcat tattcatgtg gttgcccagt 120 
gccatgagga aggattggag agccacttga ggtcatatgt taagtacgcg tataaggctg 180 
agccatatgt tgcctctgaa tacaagacag tgcatgaaga actgaccaaa tccatgacca 240 
cgattctcaa gccttctgcc gatttcctca ccagcaacaa actactgaag tactcatggt 300 
ttttctttga tgtactgatc aaatctatgg ctcagcattt gatagagaac tccaaagtta 360 
agttgctgcg aaaccagaga tttcctgcat cctatcatca tgcagtggaa accgttgtaa 420 
atatgctgat gccacacatc actcagaagt ttcgagataa tccagaggca tctaagaacg 480 
cgaatcatag ccttgctgtc ttcatcaaga gatgtttcac cttcatggac aggggctttg 540 
tcttcaagca gatcaacaac tacattagct gttttgctcc tggagaccca aagaccctct 600 
ttgaatacaa gtttgaattt ctccgtgtag tgtgcaacca tgaacattat attccgttga 660 
acttaccaat gccatttgga aaaggcagga ttcaaagata ccaagacctc cagcttgact 72 0 
actcattaac agatgagttc tgcagaaacc acttcttggt gggactgtta ctgagggagg 780 
tggggacagc cctccaggag ttccgggagg tccgtctgat cgccatcagt gtgctcaaga 840 
acctgctgat aaagcattct tttgatgaca gatatgcttc aaggagccat caggcaagga 900 
tagccaccct ctacctgcct ctgtttggtc tgctgattga aaacgtccag cggatcaatg 960 
tgagggatgt gtcacccttc cctgtgaacg cgggcatgac tgtgaaggat gaatccctgg 1020 
ctctaccagc tgtgaatccg ctggtgacgc cgcagaaggg aagcaccctg gacaacagcc 1080 
tgcacaagga cctgctgggc gccatctccg gcattgcttc tccatataca acctcaactc 1140 
caaacatcaa cagtgtgaga aatgctgatt cgagaggatc tctcataagc acagattcgg 1200 
gtaacagcct tccagaaagg aatagtgaga agagcaattc cctggataag caccaacaaa 1260 
gtagcacatt gggaaattcc gtggttcgct gtgataaact tgaccagtct gagattaaga 1320 
gcctactgat gtgtttcctc tacatcttaa agagcatgtc tgatgatgct ttgtttacat 1380 
attggaacaa ggcttcaaca tctgaactta tggatttttt tacaatatct gaagtctgcc 1440 
tgcaccagtt ccagtacatg gggaagcgat acatagccag tgtgagaaag atatcaagtg 1500 
tgcttggaat ttctgtagac aatggctatg gccactcgga cgcagatgtt ctgcaccagt 1560 
cattacttga agccaacatt gctactgagg tttgcctgac agctctggac acgctttctc 1620 
tatttacatt ggcgtttaag aaccagctcc tggccgacca tggacataat cctctcatga 1680 
aaaaagtttt tgatgtctac ctgtgttttc ttcaaaaaca tcagtctgaa acggctttaa 1740 
aaaatgtctt cactgcctta aggtccttaa tttataagtt tccctcaaca ttctatgaag 1800 
ggagagcgga catgtgtgcg gctctgtgtt acgagattct caagtgctgt aactccaagc 1860 
tgagctccat caggacggag gcctcccagc tgctctactt cctgatgagg aacaactttg 1920 
attacactgg aaagaagtcc tttgtccgga cacatttgca agtcatcata tctgtcagcc 1980 
agctgatagc agacgttgtt ggcattgggg gaaccagatt ccagcagtcc ctgtccatca 2040 
tcaacaactg tgccaacagt gaccggctta ttaagcacac cagcttctcc tctgatgtga 2100 
aggacttaac caaaaggata cgcacggtgc taatggccac cgcccagatg aaggagcatg 2160 
agaacgaccc agagatgctg gtggacctcc agtacagcct ggccaaatcc tatgccagca 2220 
cgcccgagct caggaagacg tggctcgaca gcatggccag gatccatgtc aaaaatggcg 2280 
atctctcaga ggcagcaatg tgctatgtcc acgtaacagc cctagtggca gaatatctca 2340 
cacggaaagg cgtgtttaga caaggatgca ccgccttcag ggtcattacc ccaaacatcg 2400 
acgaggaggc ctccatgatg gaagacgtgg ggatgcagga tgtccatttc aacgaggatg 2460 
tgctgatgga gctccttgag cagtgcgcag atggactctg gaaagccgag cgctacgagc 2520 
tcatcgccga catctacaaa cttatcatcc ccatttatga gaagcggagg gattttgaga 2580 
ggctggccca tctgtatgac acgctgcacc gggcctacag caaagtgacc gaggtcatgc 2640 
actcgggccg cagcgttctg gggacctact tccgggtagc cttcttcggg cagggattct 2700 
ttgaagatga agatggaaag gagtatattt acaaggaacc caaactcaca ccgctgtcgg 2760 
aaatttctca gagactcctt aaactgtact cggataaatt tggttctgaa aatgtcaaaa 2820 
tgatacagga ttctggcaag gtcaacccta aggatctgga ttctaagtat gcatacatcc 2880 
aggtgactca cgtcatcccc ttctttgacg aaaaagagtt gcaagaaagg aaaacagagt 2940 
ttgagagatc ccacaacatc cgccgcttca tgtttgagat gccatttacg cagaccggga 3000 
agaggcaggg cggggtggaa gagcagtgca aacggcgcac catcctgaca gccatacact 3 060 



73/82 



WO 00/77040 
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gcttccctta 
ccatcgaggt 
cctcggccga 
aggtcaatgc 
gatatcctga 
gcggtcaagc 
aagaaatgaa 
agate tgccc 
gecatcagtg 
gtgtgattac 
tgctttccaa 
agaaaggaaa 
gtgeacatat 
aggtgtggta 



tgtgaagaag 
ggecattgae 
ggtggacatg 
tggcccacta 
caataaagtg 
ettageggta 
agccaactac 
cctggaggac 
ggactccaac 
atctcatggc 
agccaatcac 
taaagaacaa 
ttttttaaat 
gacactcttg 



cgcatccctg 
gagatgagta 
ateaaactge 
geatatgege 
aagctgetta 
aacgaaegtc 
agggaaatgg 
gaagacgagc 
aagcacaatg 
ccgtgtgtgg 
tggggagacc 
cgttatttct 
ctcactggca 
agctggactt 



tcatgtacca 
agaaggtggc 
agctcaaact 
gagctttctt 
aggaagtttt 
tgattaaaga 
cgaaggagct 
gtcttaccga 
gttcacggga 
ggacttgett 
gagcacaggg 
taacagactt 
atattcaaag 
agattttatt 



gcaccacact 
ggagctcegg 
ccagggcagc 
agatgataca 
caggcaattt 
agaccagctc 
ttctgaaatc 
attcccttca 
tgaccagctc 
tgtcatttgc 
aggaccaagg 
tctataggag 
ttttcattgt 
cttccttgca 



gacctgaacc 
cagctgtgct 
gtgagtgttc 
aacacaaagc 
gtggaagctt 
gagtatcagg 
atgeatgage 
catcttcaac 
gtcttcggtc 
aaactcagga 
ggaaggggag 
ttgtaagaag 
gtcttaacaa 
gagtagtgt 



<210> 91 

<211> 2301 

<212> DNA 

<213> Homo sapiens 



3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3899 



<220> 

<221> misc_f eature 
<223> Incyte ID No: 



1686305CB1 



<400> 91 

accgggctcg 

tggtcagacc 

aaatgcagaa 

acaaaaggat 

tcgaaatggg 

cagagaggaa 

tgatgtggac 

tcaagaatca 

aaaagaaaag 

cacaaaaggc 

atattttaga 

gaatagaagt 

attagaagag 

agcaacaaat 

aaaggaagct 

acagaataag 

gaaatatgaa 

ccttcttcca 

aaaagactgc 

aggaactgea 

tttacttctg 

attaattgag 

ggtaaaaccc 

tattgaagaa 

tgctccagag 

taagaagaat 

attgaaaata 

ggatgtatgt 

gtttgecaca 

cagacaattg 

tttgtatcct 

aattctgeat 

aagactttct 

aaagattgea 

cgaagaactc 

gactttcata 

tatagcagga 

taactagtga 

tgagatacaa 



gcgtgagtcg 
ataatgactt 
gaattacaag 
atggaactaa 
aattttagga 
aacacaaaaa 
cgtatccttg 
gagteggaag 
ggcaataaat 
atggatgccg 
ctgaaaaaat 
tatacaaagg 
gecaaaaaag 
gaactcagga 
gacatagtga 
cagcaggcca 
agagcaattg 
gctaacagag 
acacaagcca 
agaacatttt 
gaacctggaa 
aaaggacact 
attgataatc 
actggtaatt 
aataatccta 
teaagecaag 
gaagaagtca 
cagtcttaca 
actgttcttc 
aaaagttctc 
aagttgtttc 
gacttttaca 
gaactaaaaa 
cgtgcattat 
aagaaaagat 
tgtaaatttt 
aaggactatc 
gaattgtatt 
aaaaaaaaaa 



ctgegggget 
cagcaaataa 
actttatgeg 
gaagacagaa 
aaa agaagaa 
acaggataaa 
atgagcttga 
aagatgggat 
acttcaaaca 
atccatataa 
ttgctgttgc 
cttattccag 
attatgaaag 
aaatcagtca 
ttaagtcaac 
tttcagagaa 
aatgetatae 
etatggecta 
ttttattaga 
tgggaaagct 
ataagcaagc 
gggatgatgt 
caccgcatcc 
tgatacagac 
ttaatctagc 
atgacctttt 
gtgatacttc 
gcgagaaaat 
ctccaattcc 
cagatatgtt 
agaaaaatct 
ttgagaaaga 
ggtttgatat 
ttaatcacat 
acggtggttg 
ttctactgaa 
tttgaacata 
caagtgaact 



gacggggtgg 
agcaatcgaa 
ggatttagaa 
tggtgttcct 
aggcaaagct 
atcttatgat 
caaagacgat 
tcatgtagat 
aggaaaatat 
tcccgtgttg 
tgagtctgat 
aegaggtget 
agtattagaa 
ggctttagca 
agaaggagag 
agateggggg 
tcgagggata 
tctgaagatt 
tggctcatat 
aaatgaggca 
agtaactgaa 
ctttcttgat 
tggatcaact 
tattgatgtg 
aaatgtaata 
tcccacaagt 
atccctgcaa 
geccatagag 
tgeaaacteg 
gtatcagtat 
ggatccagat 
aaagecatta 
ggcagtgatg 
agacaagtca 
atttccattt 
agtgttttgc 
agttaattaa 
ctgtttttct 



cagtgcggcg 
ttacaactac 
aactgggaaa 
gaagagaatt 
aaagagtctt 
tatgaggcat 
agtacccatg 
tcacaaaagg 
gatgaagcaa 
ccaacgaaca 
tgtaatttag 
gctcgatttg 
ctagaaccaa 
tccaaagaaa 
cgaaagcaaa 
aatggatttt 
gcagcagatg 
cagaaatatg 
tctaaagctt 
aaacaagatt 
ctctccaaaa 
tccacacaaa 
aaaccactca 
ccagatagca 
gcagccacag 
gatactccaa 
cctcaagcca 
atagaacaaa 
ttccagctcg 
ttaaagcaaa 
gtattcaacc 
ctcatctttg 
tttatgtcag 
ggattgaagg 
ttgctgaaat 
tttttaagaa 
ctataaggtg 
gaaaataaaa 



ggttacggcc 
aagtgaaaca 
aagacattaa 
tacctcctat 
ccaaaaaaac 
gggcaaaact 
agtctctgtc 
ctcttgtttt 
ttgactgeta 
gagegtcage 
cagttgeett 
etttgeaaaa 
ataactttga 
actcatatcc 
ttgaagcaca 
tcaaagaggg 
gtgctaatgc 
aagaagctga 
ttgccagaag 
ttgaaactgt 
t taaaaagga 
gacaaaatgt 
agaaggttat 
ctactgctgc 
gcaccacaag 
gagcaaaagt 
gtttgaagca 
aacctgctca 
aatctgattt 
ttgaaccatc 
agatcgttaa 
aaatcttaca 
aaacagagaa 
atagttctgt 
aattgttttt 
aatgaaatta 
aattgtgatt 
atataaacaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2301 



<210> 92 
<211> 2314 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1688972CB1 

<400> 92 

ggccctgggg aggccctgag gatagaggag gaggcctggc cgtgcctttc tggagttcac 60 
agacccctaa agacaggagg cagggatggc gtcgtaggcc tggagagcgc ctggggggcg 120 
ggagggcctg aggctgtgct taaggcccag aggtcctgga gcagggcatg tcttgtcctg 180 
aggcttaggc agggccccca ggggagttgg ggcaggaggt ggggaggaca ggatcgtgga 240 
ggcttcgcat gtcatggtgt ctgttggttt ttaggctttc ttgggaagtc tggtgtttcc 3 00 
tatgtagttt ggggactagg ccacacctct ccctcatgct tgtcaccact gctacacctt 3 60 
gtctgcccac tttctctcct tcccgcccct ggaggtgaga ccctcacctc tctcgtgagc 420 
atgctgcctc atgtcttccg tgccgtcgtt ctgtggctgg gtgactgtcc ctcaggccgc 480 
caaccctagt tggctctcac agcaacttcc agtgctcctg gacggggact tcctaagcag 540 
catgggggcc acacgggccg gccacgcaca ctggcctggg tccgcattcc ctatttcagg 600 
ctgttgtctc gggtcagatc tccggaagcg gcaggagcgc tgggtctcct ggtcgtggcc 660 
gggcagcgtc tttcttcttc agggtctgga tgccacttcc cgcctcctcc ctgtgcccag 720 
tggaaggaga aacatcctag agggtggaga ctgccctgtt tttcttttgc cattgagtcc 780 
cagtgtgact caagcacgag gccacctttc tctgccagag gacagaaagc ttcagtgact 840 
ggcccagggt cacacagttc tgccgcccag cccagcagcc tgagccagca gtcaggatgg 900 
ggtggggaca ctccgtctct cttgagaccc cactcagaga atggggctga cactgtagga 960 
gaaatataag caggactcct gctggctgct cccctcctgc atgaggcatg ttagagggac 1020 
tgggaccgcc agcccccagc tcccatgccg ggcctcggcc cagtctcagc tcctgtttgg 1080 
gagggggggc agaggggcag gggggcagaa gttgagtctg ggagctgggg cagggcacag 1140 
gctctagggt ccagctgtcc tcctctcccc tcccctgctg gccactcctg ctgccgtctc 1200 
cacagctctg gaatgatgaa gtggacaagg ccgagcagga gctccgcgtg gcccagacag 1260 
agtttgaccg gcaagcagaa gtgacccgtc tcttgctgga gggaatcagt agcactcacg 1320 
tgaaccacct gcgctgcctc cacgagttcg tcaagtctca gacaacctac tacgcacagt 1380 
gctaccgcca catgctggac ■ ttgcagaagc agctgggcag atttcccggc accttcgtgg 1440 
gcaccacaga gcccgcctcc ccacccctga gcagcacctc acccaccact gctgcggcca 1500 
ctatgcctgt ggtgccctct gtggccagcc tggcccctcc gggggaggcc tcgctctgcc 1560 
tggaagaggt ggccccccct gccagtggga cccgcaaagc tcgggtgctc tatgactacg 1620 
aggcagccga cagcagtgag ctggccctgc tggctgatga gctcatcact gtctacagcc 1680 
tgcctggcat ggaccctgac tggctcattg gcgagagagg caacaagaag ggcaaggtcc 1740 
ctgtcaccta cttggaactg ctcagctagg caggtgcccc catccccccc gcattctggc 1800 
ctaggcagga gaggatgggc gcagcctgcc acttaacttg tttgttggtg acacagttgt 1860 
tcagagtggg gagaattcac cccattctgt ccctgcccct agtcacctag ctgtgagggt 192 0 
gcctgaggct gaatggctcc acccctcccc cagccctgct tctgacctgt ggctctggag 1980 
cccctgcccc tgcctgcatc cccgagcacc ccaccctcca ggctccacta aggagggagg 2040 
ggctgtctgc agcagctgca ctcagcacct aggccagggt ggggccgccg cagatgggct 2100 
caggaagccc caggtgcact cagcgagagc cctgcctttc agttgccaaa agctgcatca 2160 
ggggaatgcg gcaaggcaca cagggctctg gcagcccctg gggactgggc gctgcccctg 2220 
ggaggggaga gcctggccag ggctggtgtt gggcccggag cagcatcttc cggtgctatc 2280 
ctcccctccc acccctcaca gctcaagcca agtc 2314 

<210> 93 
<211> 1880 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1812494CB1 

<400> 93 

gaagaccctc gggaagcagt cagactgcac tcacctcaga cacactggaa aacagtaaga 60 
gaggagagaa agaagcctac tgaggaagaa ataagaaaga tctgcaggga tgaaaaggaa 120 
gcgctggggc agaatgagga atctcccaaa cagggtttga ttactatctg gcaactgtca 180 
gatctgagtt tctgtcctaa aaatgcactg gcaaattccc tactctcata actgacccat 240 
ttccttattt aaacacacac ccacagtgtc tatcttagca attactgaga catgtttagt 3 00 
gacctttcca aatctattat ctcttttgtt atggatgttt ccccagagct aagaaaatct 3 60 
ttgtgtggag cagatgcgtg gatgtgttgc atggatctca catagtttat ctttgattgg 42 0 
atgctgtggt ttgaggctga gggctgcttc catgctttcc tctattgtgc ggcaccctaa 480 
ggctactcat acggagagtc cagggacttg ttcttgtctg ggctgagagc ctcatttctg 540 
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catgtgatat 
acaatttcat 
gatggatttc 
caggtagagt 
gtattgccca 
gctggggcac 
tcccaggcag 
ccagagctct 
ctgcagctag 
ggctccggtg 
cgtctgctgc 
tatactgttg 
ccagatgcct 
gcacggcagg 
cagcagaggc 
ctcagtgacc 
gcccctacct 
ggggcatccc 
cgttgcctcc 
ctacctgggg 
ttccccgtga 
caaagacagg 
agtttggcaa 



aatggactca 
ttcaggcttt 
acatgtcagg 
tggagctagt 
agcggaggag 
atcggactct 
ttgctgcccc 
ggaatgcact 
ctcccagccc 
gctttactct 
tggaagcagg 
cggctgacaa 
acgattacaa 
ctacacggaa 
agcaggagca 
gagagaagag 
ctccaatccc 
tccaaggcct 
aggatcatcg 
ccaactcagg 
aaccagagat 
gctagaggta 
ggaaaaaaaa 



m 

^^rcttgatg 

cattcttatg 
tttttcccta 
ggagttgact 
gagaaaaagg 
tctccagcaa 
cttgggccct 
gcttgctgct 
tgcagaccct 
cctgcatgca 
tgctgacccc 
atcaacacgt 
caaggctcag 
aagggagcag 
ggaggagcgt 
agctctggct 
tgactctgca 
gactcccttt 
ccgtcaggca 
gacctgagag 
gatttggaag 
tgtggagctg 



tgctactttt 
catctgcttc 
ggttcagggt 
gtggggactc 
aataagaagg 
actcaagaag 
ttgctggatg 
tgccgagctg 
agagttctgt 
gcagctgcag 
actgtgcagg 
aatgagttcc 
gtgccaggac 
aaggcagccc 
gaaagagaag 
gcagagcgcc 
atcgtcaata 
cactacctcg 
gggaggccct 
ggcacattca 
atgggggtga 
gtactgtctc 



a^^aa 



PCT/US00/16636 



atctgggtga caaaataaaa 
taccctctga gtactcccta 
cggagggaga agatggcttt 



tggatctttg 
agaaaagccg 
aggagccttc 
aggccaaagc 
gagatgttgg 
ctctgctcag 
ctggaagagg 
actctcgggc 
gaaggttcat 
cattgacacc 
ggcggcaacg 
agcagcggcg 
gactcgctgc 



tgagtctgaa 
agaccaggag 
cacacagtca 
ccctggtcag 
agtgctaaag 
tgcccccttg 
ctcagtggtt 
ccggccacct 
ggagaagaat 
agaaatggag 
ggaggaacag 
atttgccgcc 
ccagttggga 



ctcgacgctg ctggagttgt 
acttctcttt ctgctccaca 
cttcctgatc tcttacagct 
cagcagccct aggttttttc 
aggacactcg ggaactaggg 
tggaatttta atcacaataa 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1880 



<210> 94 

<211> 879 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2013853CB1 



<400> 94 

ccctctccga 

gttacccggc 

ggtttaagaa 

tgccggagct 

gtgatggaga 

gcgacgagga 

acatttccaa 

actggcagct 

ggaagatatc 

tggtcctcat 

cctatctctt 

tttctttttg 

ttctccttct 

agtggggaaa 

accaaactag 



ccctttgagc 
ggagatgtgc 
gttggacttg 
gcgccacaac 
agtggacttc 
gcagaagttg 
cggggagctc 
ccagcagctg 
ctttgaggaa 
cgtatgagcc 
tcaagatttg 
tgaagccata 
ctgaatgact 
gaaaaggaaa 
aactttaaaa 



cgtggccgtt 
tcccactttg 
gacaaatcag 
ccgttggtgc 
aaggaattca 
aggtttgcgt 
ttccaggtgc 
gtcgacaaaa 
ttcagtgctg 
tttttcttac 
ctcaagacgt 
tgtcctaaca 
cagggtaccc 
tggcttttat 
gttcaaaaat 



gccagatgtc 
acaatgatga 
ggtctctgag 
ggcgagtgat 
tcctggggac 
tcagcattta 
tgaagatgat 
ccatcatcat 
tggtcagaga 
aagcaccacc 
ccaactgtct 
ggagcttcat 
tataggggga 
aaacatcttt 
aagaaagta 



cacaatggga 
aattaaaagg 
cgtggaggag 
cgacgtcttc 
ctcccagttc 
cgacatggat 
ggtgggcaac 
cctggacaag 
cctggagatc 
caacaacttc 
ctctgactta 
caccaactca 
agagcaagtc 
tactttgttt 



aacgaggcca 60 
c t gggc agga 120 
ttcatgtccc 180 
gacaccgacg 240 
agcgtcaagg 3 00 
aaaga t ggc t 3 60 
aacctgacgg 420 
gatggcgatg 480 
cacaagaagc 540 
tgctttcttc 600 
tctggaagta 660 
gtgctattaa 720 
aaatgagcat 780 
tgattcaaag 840 
879 



<210> 95 
<211> 3162 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<223> Incyte ID No: 



2284925CB1 



<400> 95 

tgcgcatcgg 

agga aga tag 

atattgctcg 

ttgtaattga 

gggccacaaa 

accatgaaga 

agaatagcag 



gcccatgaga 
tcagagcgat 
gactccagag 
tcatagacaa 
tccctccgag 
ggcgtcagtc 
cataattgct 



ctgacgcagg 
ggcttctggt 
tcagcccttg 
actcagaact 
cacacggtga 
cttcctcttc 
tgctataatg 



accctattca 
gggcctgcga 
aatgctttct 
tcgatgcaga 
tcctcgcagt 
tccacgcagg 
aactgattca 



ggttttgctg 
cagagctggt 
tgataagcat 
agcagtgtgc 
ggtttcgcga 
tttcaacagg 
aatagaacat 



atctttgcaa 60 
tatagatgca 120 
catgaaatta 180 
aggtcgatcc 240 
gtatcggatg 3 00 
agatttatgg 3 60 
ggggaagttc 420 
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gctcccagtt 
aagccataga 
ggatgatggg 
gcgataagaa 
agtggcaggg 
agatcacccc 
tgtgttgtac 
attctcagac 
tgaaatcgat 
ccatggcgag 
tcaatgcagc 
agattttacg 
acaccagtga 
agtatgtgtt 
tcaatgatgt 
acatctttga 
tcttctctcg 
ggttccaagt 
ctgccgacgt 
tcgatcagtt 
cgggaaggac 
acactgctgt 
ccaaatgcaa 
ttattgacat 
tgaacagcat 
ctgctgggaa 
ctgacgtggc 
ctgaggagta 
cagtgtttga 
tcataacaga 
atttggctga 
ggcttccatc 
ctgtgaatca 
gtgaaggagc 
tagaaagtta 
aatacttggc 
ctggttcact 
cccaggatgg 
tcattgctgt 
gcttatctga 
taggaccaaa 
tggccgatgg 
actggtatca 
atccatgatg 
tctggctcac 
ttgaatgccc 



caaattacgg 
aataacaagc 
ctaccacaaa 
ccgggcagac 
ggtttactat 
agtgattggc 
cactgacaat 
agagcctcat 
atcatctcga 
gatccactcc 
ccaagaaaac 
gaccacagaa 
tcttgttgga 
tactaagaat 
tcccccttgt 
attggaagcc 
gtttggagta 
gatcgaagcc 
cctgcacgcc 
ggatgaggtg 
caactctttc 
tctggagagt 
cattttcaag 
ggttttggca 
caacaagcca 
gaacttccct 
caacccatgc 
ttttgcacag 
ccggaatacc 
catgtttgat 
caactacaaa 
tgacagctaa 
cagtagcgta 
taatgtttaa 
tgttccatga 
aaactccttt 
gtaactagca 
tgactaagta 
ttagcttgac 
acagatttat 
ttacagataa 
tatgacacct 
gcagcctgtg 
agggaaaaag 
agaggcagcc 
ttggacaagc 
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r tgtaatt 
gatgaccacg 
ggtgagctcc 
cttctcgaca 
gccagacgga 
caaggaggga 
aataagcaga 
tcattcagat 
ggcagtgatg 
atgaccatcg 
agcccagtca 
ctgtactccc 
ggcctgatga 
gtgcaccaga 
atctctcaat 
attacgcata 
tgtgagtttt 
aactaccact 
accgctttct 
gcagccctca 
ctctgcaatg 
caccacaccg 
aatattgaca 
acagagatga 
atggcagctg 
gaaaaccaaa 
cgccccttgg 
actgatgaag 
tgtagcatcc 
gcttgggatg 
cactggaaga 
agccaagcca 
aacaagaggc 
tatttgacct 
agaaaaatat 
gctctgctgt 
ggccacagga 
gcttagctag 
tgttttcctc 
ctaagaaaaa 
actagttagc 
aagttagaac 
taaccccttt 
tggcatttca 
acgaggcact 
ttttcttaaa 



cagtgtttac 
tgattcagta 
tgggaaaaga 
ccatcaatac 
aatccgggga 
aaattaggca 
ttcacaagat 
ataagaacag 
caccaagcct 
aggctcccat 
cagtagcgga 
ctcagctggg 
ctgacggctt 
gtcacagtca 
tacttgataa 
aaaggccatt 
taaactgttc 
cttccaatgc 
ttcttggaaa 
ttgctgccac 
caggcagtga 
ccctggcctt 
ggaaccatta 
caaaacactt 
agattgaagg 
tcctgatcaa 
acctgtgcat 
agaagagaca 
ccaagtctca 
cctttgcaca 
cactagatga 
cagagggggc 
cttcctttct 
tgaatcattc 
atgttctttt 
catcctgtgt 
agcaaagcct 
tgatcagctc 
aagaacatcg 
aaaaaaaaga 
ttcacagcct 
acagccttgg 
cctgtaaaag 
tttttgggga 
acaccaagta 
aaaaaaaaaa 



agcattagat 
tgtcaaccca 
actcgctgat 
atgcatcaag 
cagcatccaa 
ttttgtctcg 
t cat eg t gat 
gaggaaagag 
geagaategt 
cacaaaggtt 
agecttggae 
taccaaagat 
gagaagactg 
ecttgeaatg 
tgaggagagt 
ggtttatctg 
tgaaaccact 
ctaccacaac 
ggaaagagta 
agtccatgac 
gcttgctgtg 
ccagctcacg 
tegaaegctg 
tgaacatgtg 
cagegactgt 
aegcatgatg 
tgaatgggct 
gggactacct 
gatctctttc 
tctgccagcc 
cctaaagtgc 
ctcttgaccg 
aatgacaatg 
aagtccccaa 
gaatacttaa 
acccttgtca 
tggtgcctgt 
atcctttacc 
atctgaagga 
cataaaataa 
ctatggctac 
ctggtgggtg 
gggttcatct 
atccatgagc 
ttatataaaa 
aa 



cactgtcatg 
gecttcgaaa 
ctgcccaaaa 
aagggaaagg 
cagcacgtga 
ctcaagaaac 
tcaggagaca 
tccattgacg 
cgctatccgt 
ataaatataa 
agagttctag 
gaagatcccc 
tcaggaaacg 
ccaataacca 
tgggacttca 
ggcttaaagg 
cttcgggcct 
tccacccatg 
aagggaagee 
gtggatcacc 
ctctacaatg 
gtcaaggaca 
cgccaggcta 
aataagtttg 
gaatgeaace 
attaagtgtg 
gggaggatct 
gtggtgatgc 
attgactact 
ctgatgeaac 
aaaagtttga 
acaaaggaca 
acaggtattg 
atttcattct 
tgacagaaca 
atccatggag 
gagctcatct 
ataaaagtca 
ttcataagga 
gcgaaacaac 
atggttcttc 
ccctctctag 
taacaaagtc 
ttcctttatt 
gecattaaat 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3162 



<210> 96 

<211> 2740 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 2376728CB1 



<400> 96 

cggaccttcg 

aagatggtgg 

aactgttttg 

ggagtattga 

gatattcttg 

attcccctgg 

tggctaatca 

tcagaatgga 

acacccagta 

cgttgtcaga 



ccttcgctgt 
ategcttgge 
gage age tgg 
ctaagttgtg 
tatatggcaa 
aaaatgtcac 
agacaccaac 
tgaatcatat 
atgaacatgc 
aagcaaaatt 



cgccgccgcc 
aaacagtgaa 
tcaaccttta 
caggaaaaag 
tattgtcatc 
tattgattcc 
taaatctttt 
aaataaatgt 
tgctgtctgg 
cacacctgtt 



gccgcccgcg 
gcaaatacta 
actatacctg 
cc caaagcaa 
cagaagaaaa 
atcaaagatg 
gcagtttatg 
gttactgatt 
gttcctgact 
aatcgtcgcc 



gaegtegggg 
gaegtataag 
gacgagttct 
ggcagttttt 
aatataacaa 
agggagactt 
ctgccactgc 
tactctccaa 
ctgaggcaac 
accattgccg 



ctattagtga 60 
tatagtggaa 12 0 
tattggagaa 180 
cttgtttaat 240 
acaacatatt 3 00 
aaggaatgga 3 60 
tacggagaaa 420 
aagtgggaag 480 
tgtatgtatg 540 
caaatgtggt 600 
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tttgttgtct 
cctgtgcgga 
cagcctgcta 
gatgatgatg 
tgtggctatc 
ttgttctagc 
agaaagtagg 
gcaaatatat 
ttgggaaaag 
taagtaaaac 
gttagcagct 
tgatttggaa 
ttacatgaaa 
tctttgaagt 
gacagctaca 
ataaaatagt 
tatattttaa 
tacagaattg 
attcttatgc 
agtctctttt 
agaaataata 
tcgtggaagt 
attttataaa 
tataatctgt 
taaattgtga 
ttttgaggca 
cactactctt 
cagtctaaag 
aatgtcatga 
ttggttaagg 
tctccctgta 
ttgcttttat 
tatatacata 
ggagatttgc 
ataacattta 
cttaaaacgc 



gtgggccctg 
tttgtgactt 
gatcagactc 
acgatgatga 
tgagaaatca 
catgaatttg 
tccccctgcc 
cagataaatt 
atgtttgttt 
taaaagcaca 
tttttctaac 
atactttggc 
taagccccta 
gataaaatga 
aataactaag 
tgtgattata 
attaataaaa 
tcatactgca 
tttaacatta 
ttacaaagcc 
tgtaagaatg 
attgtctatt 
gttttgtaaa 
atttgaaatc 
actttaagtc 
gccattaggt 
tccatttata 
tttgttttaa 
gaagtttgat 
cactaatttt 
ggaaatgtga 
gatttttttt 
aaatcaacca 
tggtacatag 
cattgtcaac 
aaaacgcagt 
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:tgaaaag 
ctgctatgac 
ttacagccag 
tagcagtgac 
actttggggg 
cctgagaaac 
ttctcccact 
ttttgtttct 
taacaggtca 
gaatggtggg 
catcctgtct 
tttttcatat 
ccttcttact 
gtgtttatga 
tttggaggta 
tttttgtttt 
gttgtcattc 
ttagcttcta 
tttctggatt 
tgtgttagag 
gcctcaaggc 
tcagtgtgaa 
atcccaaaca 
attttgcaat 
aatctagatt 
tgaaagtata 
tgctgcaaac 
attctaaatt 
tcagtaactt 
acttacctat 
aagaaaagca 
ggtttgtttt 
acatatctga 
gaatctagca 
ctttattgac 
tagaaaagtc 



agatttcttc 
ctgctttctg 
tcattgaagt 
taaggacaca 
aaatgtaaga 
ttgtaaccta 
cctcacactc 
tgtttatttt 
tgtactacgt 
aaaggggcta 
aatggttaag 
acctagtggt 
ttctggtttg 
atgggtgtaa 
ttttcactct 
atataggtcc 
ttaggaattt 
cctttagtaa 
gaaaatctta 
cacagattta 
agaccacttt 
actatcttga 
atatttctat 
ctatggaaat 
tattttgaga 
tatttatcat 
aactacagtc 
taaaaaatct 
gtgatggagg 
tagattttga 
caacaaaact 
aatatcaggt 
aaaggatcat 
aattcaggaa 
tttgttttta 
tagaaaaaaa 



ttcccagcca 
ctggggacat 
ctcctttaaa 
tttgggagta 
ttctgagctc 
tgtgcctcaa 
ttctacaggg 
taggttattt 
tgttgttttc 
taatgtggtt 
acaccagtaa 
gccttatcat 
ttgaaaaaat 
ttaggaaata 
atatgaataa 
caaattataa 
ggtttgaaat 
gacatatttt 
taaaaccctt 
cctaggcttg 
aagtttggct 
atttgcaaat 
ttttgtaaaa 
agagtagcaa 
agtaattgtt 
ataaaacttg 
tttgaaatat 
tcaaatctga 
attctttggt 
aagtatctga 
agggtttttt 
ggatttttgt 
gaaacctgag 
ccaaggggaa 
caataaaaaa 



gtcctctaag 
ggccacatgc 
tgatatgtct 
tttaatcagg 
tctctctgtt 
tatattccat 
ataggctttt 
tcttggaagg 
atttctgtta 
cattaataat 
caaaaacaca 
aatagcactg 
acactggtgc 
cttctcatct 
atatttttcc 
ttgtcaaata 
ttatcagtta 
ttaggtataa 
gaaaataaac 
aagatttgga 
agacttcata 
atagtgttat 
caattgtatg 
ttgctatttc 
cactctttac 
atgcgttttg 
ggaaaatcag 
atataccgca 
atcttactgt 
gatatacaaa 
gttcatttgc 
ttctaagcaa 
aaatgc taat 
atgttgtgag 
tattttacaa 



660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2740 



<210> 97 

<211> 1079 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc.feature 

<223> Incyte ID No: 2790762CB1 



<400> 97 

ccggcatgaa 

caataattga 

gcttgactaa 

gcatgttagc 

ttaagcgctt 

ggagaagcgg 

ccaccacctc 

gatggacagt 

tccaaaggat 

ccgcgcaccg 

aaggatgtaa 

ctgtacacag 

agtggggacg 

ggtgagagtg 

tatgcagtgc 

gcaggcgggt 

aatttcccct 

agagagattt 



gacagactcg 
ggcagtggtt 
gccgccagcg 
tgtgtagatt 
ttaatatgga 
tgatggagag 
ccctgcaaac 
ccgaactccc 
ccttctctaa 
cccggcacgc 
agattagcgt 
gagcagagcg 
tgcatgcttg 
ctgataagaa 
tcgatgagtt 
tcacggctaa 
acgaggtatg 
gggaattgca 



cttagtcgcc 
ctaaaagctg 
cacagcgcgg 
tatgtgaggg 
gacagatgag 
ccgagctcgc 
gtccagtggt 
tgctgaggac 
gggccgcctc 
acctgcggtc 
gagctttacc 
cgacgtgcgg 
tccctttggc 
ggatgaggag 
agaagatttt 
agcaatcgtt 
ttggcagc cc 
gcatcttttg 



agtcacttaa 
tctacattaa 
caggacgcgc 
cttgtaaaac 
agcccctctc 
cccttccaag 
gcagaggtaa 
cccttcaact 
ctcatagacc 
cggaagttct 
gagagctgca 
gcggagtgcg 
gggagtgttg 
aatgagctgg 
actgacaatt 
cagagagaca 
ctcctctaga 
aaagcaggga 



gctgagtgca 
tgaaaagagc 
ccgggtctca 
tctggtcttg 
cgctcccgtg 
cgctgccccg 
tggacgttgg 
tctacggagc 
cgaactgtag 
cccctgacct 
ggagtaagga 
gtctgctcct 
gtgacggggt 
atcaggaaaa 
tggagctaga 
gagtggatga 
gggctcttag 
aattaaaaaa 



ttgtgatttc 
aatgtggcca 
gcggacttgt 
taaactagtc 
tgggcccgca 
tgagcagtct 
ctctggtggt 
ttctcttctc 
tggccacagc 
taagttgctt 
caggaaggtg 
tagccctgtc 
aggcataggg 
gagagtggag 
tgaagaagga 
agaggccttg 
caaaacccaa 
aaaaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1079 



<210> 98 
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<211> 1393 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 28691 64CB1 

<400> 98 

ctcggccggc gctgacgcag ccatggcgga ggcggctttg gaagccgtgc ggagcgagtt 60 

acgagaattc ccggccgctg caagggagct ctgcgtgcct cttgctgtgc cctacctgga 120 

caaaccccca actccgctcc acttctaccg ggactgggtc tgccccaaca ggccgtgcat 180 

tatccgcaac gctctgcagc actggccggc cctccagaag tggtccctcc cctatttcag 240 

agccacagtg ggctccacag aggtgagtgt ggccgtgacc ccagatggtt acgcggatgc 3 00 

cgtgagaggg gatcgcttca tgatgccagc tgagcgccgc ctgcccctga gcttcgtgct 360 

ggatgtgctg gagggccggg cccagcaccc tggagtcctc tatgtgcaga agcagtgctc 420 

caacctgccc agcgagctgc cccagctgct gcctgatctg gaatcccatg tgccctgggc 480 

ctccgaggcc ctgggaaaga tgcccgatgc tgtgaacttc tggctggggg aggcggctgc 540 

agtgacttct ttgcacaagg accactatga gaacctctac tgcgtggtct caggagagaa 600 

gcatttcctg ttccatccgc ccagcgaccg gcccttcatc ccctatgagc tgtacacgcc 660 

ggcaacctac cagctaactg aagagggcac ctttaaggtg gtggatgaag aggccatgga 720 

gaaggtgccc tggatcccac tggacccctt ggcgccagac ctagcacggt accctagtta 780 

cagtcaggcc caggcccttc gctgcacggt gcgggccggt gagatgctct atctgccggc 840 

tctgtggttc caccacgtcc agcagtccca gggctgcatc gcagtgaatt tctggtatga 9 00 

catggaatac gacctcaagt atagttactt ccagctgctc gactccctca ccaaggcttc 960 

aggccttgac tgatggagca ctggtgaaca cgaccaagca cgcctcgggg gacggagcca 1020 

gcccctccct ggccaggtca attctcgaga gagcctggag tgtgcatgct ggctgctggc 1080 

cccgggtcca gcatggcttg agatcagctt tggaggatct tggaatgtgg tcataaggac 1140 

tcaaggtgcc aggcaggtct gggtgagggt tctcaggaag ttgccacaca ggtgagcaga 1200 

gtggggatca ggtgcagcgg cacctctccc cagcgctgtg atgttgggcg agtcactgcg 1260 

tctcgggcat tggtgtcctg tcagtaaaga gataataatg gctgtacctc gcggggctgt 132 0 

tgtgggcttg gagatgatgt ctatgaggac cagcatggag ctggcacaca ggacatgttg 1380 

aataaaaggt age 13 93 

<210> 99 

<211> 1580 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 3317629CB1 

<400> 99 

ggcctccccg ctttgtgcta cctgcggccc ggctctcctc cgcgccccca ctcgcgctgc 60 
gatgacgege tcactgttca agggaaactt ttggagtgca gacatcctca gcaccatcgg 12 0 
ctatgacaac attatccaac atctgaacaa tggccgcaag aactgeaaag agtttgaaga 180 
ctttctaaaa gaaagggcag caattgaaga gaggtatggc aaagatctgc tcaacctctc 240 
taggaagaag ccgtgtggac agtctgaaat caacaccctg aagcgggccc ttgaagtctt 3 00 
caagcagcaa gtagacaatg tggcacaatg tcacattcag ettgeacaga gtttaagaga 360 
agaggecagg aagatggaag aattcaggga aaagcaaaaa ctacaacgaa aaaagacaga 420 
gctcataatg gatgetatec ataaacaaaa gagcttacaa ttcaagaaaa ccatggatgc 480 
aaagaagaac tatgagcaga aatgccggga caaagatgag gcagaacagg ccgtcagccg 540 
gagtgecaac ctggtgaacc cgaagcaaca agaaaagctt tttgtgaaac tggcaacttc 600 
aaagaccgea gtagaggact cagacaaagc atacatgetg cacatcggca ccctggataa 660 
ggtccgagaa gagtggcaga gtgagcacat caaggcctgc gaggcatttg aggctcaaga 720 
atgtgaacga ataaacttct teeggaatge attgtggtta catgtgaatc agctgtcaca 780 
acaatgtgtc accagtgatg aaatgtacga acaagtccga aagagtttag aaatgtgcag 840 
cattcagagg gacattgaat actttgtgaa teaaegcaaa actggacaga ttccaccagc 900 
acccatcatg tatgagaatt tctactcctc ccagaagaat gcagtcccag caggaaaggc 960 
tacagggect aacttggcaa ggagaggacc cctcccaatt cctaaaagct caccagatga 1020 
tcccaattac tctttggttg atgactacag tttgetctat cagtaaaatc aatgaaacca 1080 
gagctttttc cggctagtgc ttctgtgata tggaaagggc acccagagca gcaggaccta 1140 
tagecaegtt atgtcagcaa tgaagacttt gaagtgaacc ettgetataa ttttttagag 1200 
atttaaaatt tatggtagac atttaggaca acataagcaa gtagagttct gcagtttttt 12 60 
gaagtttaca aattgcccca ttctgaagaa ttattctttc ccagttactc aggttatgaa 1320 
tgaattaggt tttcaacatg ggaagcatga aatccacttc tggatttgga gcatccactt 1380 
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gaggagcaga ggtggcagca gSggattctg agccaccaac tgcagtagtg gctcctttgg 1440 

ctttgggcag cctggctgtg gagtttccac ggcgacacac agcctcagtg gtgcaagatt 1500 

taaaattacc ttcctttttg gctggaagac ttagaagccg cctgatcata ctttctcatt 1560 

ttacagatga ggaaataaac 1580 



<210> 100 
<211> 840 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 
<223> Incyte ID No: 



3870488CB1 



<400> 100 

gttaaaaata 

gagcctgagg 

gtgcattgca 

aacaaacaaa 

agtcctgact 

aagacccagc 

caaagttgct 

tcaagtttga 

cacgtcgcgg 

agttcaccag 

ccagaacagc 

ccctggaaaa 

gccagccagc 

gagctgcctg 



caaaaatcag 
taggagaatc 
ctccagcctg 
aaacagctac 
gtccccagat 
gctcaggaag 
gaagcagctg 
aggggagatc 
gggtggcaag 
caatgaggag 
gctcctgccc 
ccaaccactc 
ccagcgcccg 
ggcttgtacc 



ccaggtgtgg 
gcttgaaccc 
ggtgacagag 
cagggtaaag 
gaagcgccct 
gagaaggatc 
aggaaggcag 
gtggttcaca 
cacctcggga 
atgctgtgcc 
ctggagacgg 
cccctgggac 
ctcacccagg 
tggccacata 



tgatggacac 
aggaggcgaa 
caagacttcg 
get gage egg 
cagttcagca 
cccagccaca 
agaaggcega 
cgaagatgat 
tccggcgcgg 
gggaccccaa 
aggtgtacga 
ggtaagaccg 
agecctggat 
aagccccagt 



ctgtaatccc 
ggttgcagtg 
tctcaaaaca 
cgggccctgc 
agccgccagg 
geagttgeca 
gagggagttc 
gatcgacccc 
ggagatcctg 
aggcaaatat 
tgatgtcgac 
gtaggcgtgg 
cccggcgcgg 
ttaaageaaa 



agetactegg 
agecgagatt 
aaaacacaaa 
ettgactegg 
aggccaccac 
cccatggacc 
eggaagaagt 
aacgetaaga 
gaggtgatcg 
ggctacgtgc 
ttctgegate 
ggecaggaca 
gaaagtcaca 
aaaaaaaaaa 
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<210> 101 
<211> 1970 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 
<223> Incyte ID No: 



3886318CB1 



<400> 101 

gaeggaggee 

agtgagcgtg 

egaeggggtc 

acccggaggc 

aageggegge 

cagcactccc 

gaggaggecg 

atcttcagca 

tggcccaggc 

gtgccaagta 

aggaccctgg 

aegageggge 

ageccgagga 

gctccttgct 

ccctggctgg 

tgtttgatgc 

actacttcta 

ctgacctgcc 

ccccctctgc 

ctctcaaggc 

ccccagctcc 

taggecaagg 

ctcccaggga 

aagctggggg 

agcagcagaa 

atctcttcaa 

gtgaggggee 



tggcactegg 
agttccgtgc 
ctggagccgc 
gcgggcgggg 

gggagcttcc 

tggcaggtca 
tccagcagat 
ggtagagcag 
caagattgag 
ccctgctcca 
ectgeagaga 
ectgeaggag 
egatgeagaa 
gctcttcaac 
tgctgtaaca 
ccccttgtcc 
tgtgccagac 
cggcattgcc 
ccctggcacc 
agacctacaa 
tgaggtgctg 
cgccaggcag 
agtggttgac 
categgcaag 
ggagcaggag 
caagctggtc 
eggaggagee 



aagacaaegg 
ccaggcccgc 
gctctgggag 
tgggeggegg 
gggagggegg 
gacctatgcc 
ggeggatgee 
agccggagcc 
aagatcaagg 
gagcgcctgc 
cgcccccgcc 
aagctgaagg 
gagggacttg 
accaccgaga 
aagacccatg 
atcagcaaga 
ctgggccagg 
aacgacctca 
attccagaac 
gatggggtac 
gccagtgcac 
gacgacagca 
ccctccggtg 
gccaagctgc 
caagtgagag 
atgaggegea 
tttgcccgcg 



atgggagccg 
gactcggtcc 
ggcgcacgga 
ggctaggacc 
ctcgcaggca 
gtgcccctca 
ctgcagtacc 
aggtgeagge 
gcagcaagaa 
aggaatatgg 
acaggatcca 
actttcctgt 
ggggtcttcc 
acctgtacaa 
tgatgctggg 
gagagcagct 
tgectgagat 
tgtacattgc 
tgcccacctt 
taacaccacc 
ccccactccc 
gcagcagcgc 
gccgggccac 
gcagcatgaa 
ccacgagcca 
agggcatctc 
tgtcagactc 



tgtgcacgtc 
accaggacag 
gggtgaacgg 

cagcggctcc 
ccatgactcc 
tccagccaga 
tgcagaaggt 
cattggagag 
ggccatcaag 
ctccatcttc 
gagcaagcac 
gtgcgtgagc 
cagcaacatc 
gaagtatgtc 
ggcagagaca 
ggaacagcag 
tgatgttcca 
cgacctgggc 
c cacao tgag 
cccaccgccc 
accctcaacc 
gtctccttca 
tetgetagag 
ggagcgaaag 
aggtgggcac 
tgggaaagga 
catccctcct 



gggagctegg 
cgctccgggt 
eggegttagg 
ggcagagegg 
tgtgaggatg 
cctgcggcga 
ctctggagac 
aaggtctcct 
gtgttctcca 
acgggcgccc 
cgccccctgg 
accaagccgg 
agctctgtca 
ttcctggacc 
gaggagaagc 
gtcccagaga 
tcctacctgc 
cccggcattg 
gtagecgage 
ccaccaccac 
gcggcccctg 
gtccagggag 
tccatccgcc 
ctggagaaga 
ttgatgtcgg 
cctggggctg 
ctgccgccac 
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cgcagcagcc acaggcagag gaggacgagg acgactggga atcctagggg gctccatgac 1680 

accttccccc ccagacccag acttgggccg ttgctctgac atggacacag ccaggacaag 1740 

ctgctcagac ctacttcctt gggagggggt gacggaacca gcactgtgtg gagaccagct 1800 

tcaaggagcg gaaggctggc ttgaggccac acagctgggg cggggacttc tgtctgcctg 1860 

tgctccatgg ggggacggct ccacccagcc tgcgccactg tgttcttctc ttaagaggct 1920 

tccagagaaa acggcacacc aatcaataaa gaactgagca gaaaaaaaaa 1970 

<210> 102 

<211> 1258 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 4043 934CB1 

<400> 102 

gaatttaata cgatcactat agggaatttg gccctcgagg ccaagaattc ggcacgagga 60 

tggggcaatg cttgaggtat cagatgcact gggaggacct ggaagagtac caggccctga 120 

ccttcctgac cagaaatgaa attctgtgca tccatgacac cttcctgaag ctctgccctc 180 

ctgggaagta ctacaaggag gcaacgctca ccatggacca ggtcagctcc ctgccagctc 240 

tgcgggtcaa ccctttcaga gaccgtatct gcagagtgtt ctcccacaaa ggcatgttct 3 00 

cctttgagga tgtgctgggc atggcatctg tgttcagcga gcaggcctgc ccaagcctga 3 60 

agattgagta tgcctttcgc atctatgatt ttaatgagaa tggcttcatt gatgaggagg 420 

atctgcagag gatcatcctg cgactgctga acagtgatga catgtctgag gacctcctga 480 

tggacctcac gaaccacgtc ctgagtgagt cggatctgga caatgacaac atgctgtcct 540 

tctcagagtt tgaacatgca atggccaagt ctccagattt catgtactcc tttcggattc 600 

gcttctgggg atgctgatgt agcggcaaat acctgacatg gcagcctcga gggagaccac 660 

aggaatcgaa ccccctccag cactggaggg agctggtttg aagtgtgact ttgtactggg 720 

cccacactca cctctagaat attgtttatt agataaaaga aaaagctttt ccttagccca 780 

tcagatcatc gctttttaaa tgcagggtca tacatggtac tttttattaa gaactgccct 84 0 

ttccagggct tcagtgtgcc agcgatgtca agcaggctgg ggtggcaatc tttctgaggg 900 

aatagttcaa atctcaaccc atgtcatagc agggggccaa gccaaatggg atgaaggtcc 960 

ctagcaagat acatgtcctt ccctcccttc atcaaaaccc ccgaccccca gcacctcaca 1020 

gttcacagct gcacagagac gtgcacatag cagccattcc agccggtgcc cggctcccca 1080 

ctccccttct gagggagaag gcatcttggc cctgattgtg ggaccagcaa ttagagttta 1140 

tttgctttgc tat t cage ca agctctggaa tgaatttggc caccacagac agcctttggg 1200 

gtccagctcc tcagggactt tgagacacag aagaagcccc ccattgggtt tttcttat 1258 

<210> 103 
<211> 685 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte ID No: 4371445CB1 

<400> 103 

tgagacctgt ctaaatttgg tttgtgcctt aattgagtta gtcattgaac ttccaaaaat 60 

gttgetgect gaagaggaat ttcagtaacg ttcttcatgt ttacaataat tttcccagtt 120 

tgtaaaaaca gtctaaatat tttctgaaaa ttgaaaaact tgaaaaacta aaaaattacg 180 

ttgctgtaaa ctctgtgggc ggtgtaaaaa gtttagttgt ggaaaaaata catcattatt 240 

ttctttgtct gacatagaag actcagcctg gatttcaaaa ctgaacactg aagtggtttg 3 00 

caacagttac ctggctcttg gcctcaatcc agctggatca aactgagacc tgaaaagaga 3 60 

tattgactcc tgtggcagtg gaaagaccca agaatgecag tcaagaagac agatactgac 42 0 

cgagctttat cattactgga agaatactgc aaaaaattaa ggaaaccaga ggagcagctg 480 

ttgaaaaatg cggttaaaaa ggtgatgggt atctttaaga gcagcttatt ccaagctttg 54 0 

ctaggtatgt attatgaaag ttattcatca ttttaatggt cagaattggg cttgatagtt 600 

tgaataatgt ggtcaaagaa cctgatgtta accaagtttc aaactgggat aattttcttt 660 

aatactggaa gggaagattt gtttt 685 

<210> 104 

<211> 1886 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 5527925CB1 

<400> 104 

tgctgcgctt ccgcaaagat ggcggcggct 
ggcggaaaac tgcgacgggg gctcgctgtc 
ttggcagcgg cagtggccgg cgcggccctg 
cgcgtcagtg ttgcggcgcg ggatggcagt 
catggaataa tatatattgg gaaaccgtct 
tcactcgaac atgaaggaga atattatatg 
tttgagcaaa tggaacgtaa aacttcagtc 
acactgtcag ggatccaaac agctggctgt 
aaagggctaa tttcatatac cgagtatctt 
tctggatttc atgttgcttt taaaatgctg 
agggaatttt ttaagctgca gaagatcata 
actaatgaaa ctggatatca ggaagcaata 
cagatgcgtt tctttggaaa aagaggacaa 
tttatggaaa atttacaaac agagattcaa 
ttgagtttca tgagaaaaga agactttgca 
aataaagata tttattggaa aaatgtgaga 
ttggatgaat tcaagtcatt ttgccatttt 
atgcagatgt tcagtttagc tcatcgtcct 
aaagtagcaa caggacaaga actctcaaac 
gatttggatg gtgatgaatg tcttagtcat 
atgcatcgag gtttatgggt accacaacat 
aagaaagaaa gcattaaagg agtaaaagaa 
taataaaaga tataatagta tggcaattat 
ttagaagtac ttgctattta tcttcttaag 
tgtcttgtac ttgctttctg attcataatt 
agtataaaaa tagatattgg attctgtcag 
ttcggtaaga tggaaaactt agtgattcac 
tttctggaag gataaaacaa atacatatgg 
aattccagac atcttcatgt tcctaatagg 
tttctcatct gtaccttata tctcataaat 
tgtttattgt ctttaaaata acatgtaaac 
tatttacata gatacaaaaa aaaaaa 



gcgggtagct gcgcgcgggt ggcggcctgg 60 
agccgacagg ctgtgcggag tcccggcccc 120 
gcaggagcag gagcggcctg gcaccacagc 180 
tttacagtct ccgcacagaa aaatgttgaa 240 
cttcgtaagc agcgcttcat gcagttttct 3 00 
acaccacgag acttcctctt ctcagtgatg 3 60 
aagaagctga caaaaaagga catcgaggat 420 
ggatcaactt ttttcagaga ccttggcgat 480 
ttcttgctta caatcctcac taaaccccat 540 
gatacagatg gtaatgagat gattgaaaaa 600 
agtaaacaag atgacttgat gacagtgaaa 660 
gtgaaagaac ctgaaattaa cacaactctt 720 
agaaaacttc attataaaga atttcgaaga 780 
gaaatggaat tccttcagtt ttctaaaggt 840 
gagtggctac tttttttcac taacactgaa 900 
gagaagttgt cagcaggaga gagcattagt 960 
acaacccact tggaagactt tgctattgcc 1020 
gtcagactag cggagtttaa gagagctgtg 1080 
aatattttgg acactgtctt taagatcttt 1140 
gaagagtttc ttggggtgtt aaaaaacaga 1200 
cagagtatac aagaatactg gaagtgtgtg 1260 
gtctggaaac aagctggaaa aggtcttttt 1320 
attgttccaa atgtcaaaat ttgtgatttt 1380 
tcttcattga tattctgtgt gaaataagca 1440 
ttatgaaaga acttagtaga aagaaaagta 1500 
aaggcctaga tttgaaataa tgttttgtac 1560 
tgatttctta gacactctaa tatgatatgc 1620 
gaaaaagtac ttgagaccaa ggccagcatc 1680 
ctaaatgaag ttaaaaactt atttcagatt 1740 
ttattgcata ttttatgtca gtagcttagc 1800 
ttcaatgttc tatctggaag cagaataaaa 1860 

1886 
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